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Abstract 


The problem considered is that of recovering digital information stored on a magnetic disc after the disc 
has been erased. Even after erasure, the signal that represents the information is present on the disc, 
though at a very reduced level. Communication theory techniques are applied to detect the erased signal 
in the presence of noise. For this purpose, the signal and noise in the readback channel are characterized. 
The signal dependent and nonstationary nature of particulate recording media is investigated. The 
average power spectral density description is studied in detail, and its inadequacy in characterizing 
media noise is discussed. It is indicated that media noise is completely characterized stochastically by its 
autocorrelation function. A time-domain model for the noise is then proposed which makes it possible to 
determine the autocorrelation function of the noise, for any general signal written on the disc, from a 
simple set of spectrum analyzer measurements. The models for signal and noise are used to design 
optimal and suboptimal detection bit detection schemes. The probabilities of bit error that result when 
these schemes are employed are calculated numerically for four representative case studies. Conservative 


estimates for the probability of detecting digital information from erased magnetic discs are obtained. 
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Chapter 1 


Introduction 


Recovering digital information stored on a magnetic disc after the disc has been subjected to erasure is a 
problem of interest in magnetic recording. A specific instance of this problem, which is the motivation 
for the work discussed in this thesis, is the quantification of the security of information stored in the 
discs, in terms of the probability of retrieving the information after they have been erased. Even after 
the erasure, the signal that represents the information stored on the disc is present on the disc, though 
at a very reduced level. Noise, which comes from various sources in the recording system, hinders the 
accurate retrieval of the stored information, and more so when the disc has been erased. Based on the 
statistics of the noise, detection schemes can be designed to recover the stored information. The goal of 
this project is to predict the probability of recovering the erased data at various levels of erasure when 
these schemes are used for detection. For the specific problem of interest we need to obtain conservative 
estimates of ( or upper bounds on ) the probability of information retrieval. We have applied 
communication theory techniques to the problem of detecting the erased signal in presence of noise. For 
this purpose we consider the readback system as a communication channel over which the information 
stored in the disc, is transmitted. The first part of this project concentrates on characterizing the 
readback channel to the best possible extent. This characterization involves modeling the signal and 


noise in the readback channel. The second part applies these models to estimate the probability of 


accurate information retrieval. 


1.1. Modeling the Signal 


In digital magnetic recording, a "1" is represented by a transition in the direction of magnetization on 
the disc, and a "0" is represented by the absence of such a transition. We consider that an inductive 
head is used in the readback channel. Consequently, when a string of bits is stored on the disc, the 
readback voltage at the output of the read head is a sequence of positive and negative going pulses 


which we shall refer to as the signal. The presence of such a pulse within a clock period indicates a "1", 


and its absence indicates a "0". The shape of an individual pulse, to a very good approximation, is a 


Lorentzian (20]. We shall be using this Lorentzian pulse model in our analysis. 


1.2. Noise Models 


The sequence of pulses described in the last paragraph is not the only contribution to the readback 
voltage. There is also an undesirable noise component. As a broad classification, we can consider that 
the noise comes from two sources, the readback electronics and recording medium itself. The noise from 
the readback electronics can be very accurately modeled as white noise, and is not difficult to 
characterize. The noise from the recording medium, which we shall refer to as medta noise, is more 
interesting. We shall be concentrating only on particulate recording media in this thesis. The media 


noise in this case arises due to the randomness in the locations and orientations of the magnetic particles 


that constitute the medium. 


Previous theoretical and experimental work (18, 1, 16] has shown that the noise from particulate 
recording media consists of a signal independent background noise term and a signal dependent 
modulation noise term. This signal dependence causes the media noise to be statistically nonstationary. 
This nonstationarity nature is what makes the noise in magnetic recording systems different from the 
noise in conventional communication channels. One of the ways to partially characterize the noise in the 
stochastic sense is through the use of an average power spectral density description. In the past, models, 
based on the physics of the recording process, have been developed (12, 18, 1] for the average power 
spectrum of the noise when periodic signals are recorded on the disc. The emphasis in these models has 
been to evolve an exact formulation for the average power spectrum in terms of the various parameters 
associated with the recording system. Very little attempt has been made to characterize the noise for the 
purposes of designing bit detection schemes based on the statistics of the noise. Since this is our goal we 


have taken a different approach to characterize the noise. 


With the assumption that the noise is zero-mean Gaussian, it can be completely characterized by its 
two-dimensional autocorrelation function. We have proposed a model for the nonstationary media noise 
in terms of two stationary stochastic processes and one deterministic function of time. The deterministic 
function of time depends on the signal and it reflects the signal dependent nature of the media noise. 
The autocorrelation functions of each of the component stationary stochastic processes are determined 
from a set of spectrum analyzer measurements. These are then used to determine the two-dimensional 
autocorrelation function of the non-stationary media noise when a general deterministic signal is written 


on the medium. 


1.3. Bit Detection 


The existing schemes for bit detection, i.e., deciding whether a particular bit in a bit string is a "1" or a 
"0", include level detection and peak detection. In level detection, if the readback voltage at the center 
of the bit period is greater than a prespecified threshold we decide "1" is present, and otherwise we 
decide a “O" is present. This scheme is simplistic and results in very high ae rates. Peak detection is 
an improvement over level detection; in this scheme the readback voltage is first differentiated, and then 
the presence or absence of a pulse is established by detecting a zero crossing in the given bit period. 


Both these schemes are ad hoc, in the sense that their design does not directly make use of the statistics 


of the noise. 


The detection schemes that we shall discuss in this thesis are more sophisticated; their design is based on 
the stochastic characterization of the noise in terms of its autocorrelation function. We formulate the 
detection problem as a hypothesis testing problem and evolve a decision strategy for the optimal 
detector, i.e., the detector which yields the minimum probability of error. We also analyze the 
performance of some suboptimal detectors which are easier to implement than the optimal detector. As 
one would naturally expect the probability of error increases with the reduction in signal] level that is 
caused by erasure. We have computed the probability of error for all these detection schemes at various 
levels of erasure. For the present problem of estimating the probability of erased signal retrieval, we ( if 
at all ) must err on the conservative side. Thus, we must consider the optimal bit detection schemes 
yielding the highest bit detection probabilities even if it is impractical to implement these schemes. 


These can, in turn, be used to predict the probability of correctly retrieving a long sequence of bits. 


1.4. Prologue 


The organization of this thesis is as follows. Chapter 2 provides an introduction to the readback 
channel. A model for the signal pulse is derived in this chapter, and response of the channel to the 
magnetization of a single particle in the medium is analyzed. Starting with this single particle response, 
a model for the average power spectrum of the noise is developed in Chapter 3. Chapter 4 concerns the 
time-domain model for particulate media noise, which allows us to obtain the two-dimensional 
autocorrelation function of the media noise from simple spectrum analyzer measurements. The design 
and performance evaluation of bit detection schemes which are based on the noise statistics is discussed 
in Chapter 5. This chapter contains a major portion of the work done in this thesis. Conclusions and 


suggestions for continuing work in this area are presented in Chapter 6. 


Chapter 2 
The Readback Channel 


2.1. Introduction 


The use of magnetic recording has become widespread in the last two decades for numerous reasons. A 
discussion of these reasons and an excellent introduction to the various aspects of magnetic recording 
can be found in [12]. In all types of magnetic recorders, the information to be stored is applied as a 
time-varying current in the coil of a gapped-write head; the time-varying fringing magnetic field, 
emerging from the gap, magnetizes the magnetic medium which is moving past the head. The magnetic 
materials used in magnetic recording have properties similar to those of permanent magnets, such as 
high values of remanent magnetization and coercive force. Also the elementary particles that constitute 
the medium must be physically small enough and magnetically sufficiently independent of one another 
to permit short wavelength recording, and to give a high signal-to-noise ratio ( SNR ). There are two 
types of magnetic media which satisfy these requirements; particulate dispersions and thin metallic 


films. We shall be concentrating mainly on particulate recording media in this thesis. 


In the specific application of interest to us, namely, longitudinal digital magnetic recording, the 
magnetic medium is completely saturated parallel or antiparallel to the track direction. The most 
popular scheme for storing binary digits on the medium is the NRZI ( non-return-to-zero-interleaved ) 
code in which a binary "1" is represented by a transition in the direction of magnetization of the 
medium in a prespecified bit period, and a "0" is represented by the absence of such a transition . Fig. 
2-1 shows a typical magnetization waveform that is obtained by using this code. The details of the 
writing process, i.e. process of writing transitions on the medium, are not directly relevant to us in the 
analysis of the readback channel, and hence not discussed in this report. A reasonably detailed analysis 
of the writing process is available elsewhere [20]. It is shown that the written transition can be 
approximated by an arctangent function. We shall use this arctangent transition model as the starting 


point for our analysis. 


© 


. =ewa © @& @ @ ww 


eeeeee oem «o eee ofp eo om @ 60 


Figure 2-1: Magnetization waveform obtained by employing the NRZI coding scheme 


2.2. Frequency Response of a Karlqvist Head 


Fig. 2-2 shows the head-medium configuration of a typical digital magnetic recording system in which 
an inductive head with gap thickness g, is used for readback. The medium shown is a particulate 
recording medium with thickness 6 and width w. A particulate recording medium is a dispersion of 
magnetic particles in a binder. Each of these particles can be considered as a tiny bar magnet oriented in 
approximately the same direction as the track. The head moves with respect to the medium at a 
constant velocity in the X-direction; and hence, in any analysis, we consider two coordinate systems, one 
fixed to the medium and the other fixed to the head. If (zy, 2, ) are the coordinates of any point 
in the medium with respect to the medium coordinate system, then the corresponding coordinates, 
( Ta Va» 2 ), with respect to the head coordinate system are given by 


z= 2-72, 


¥y ™ YO, 


== 2 


z, ae (2.1) 
where z = vt; v being the velocity of the head with respect to the medium. 


To find the flux that links the read head when a certain magnetization pattern is written on the 


medium, we make use of a powerful result which comes from an application of the Reciprocity Theorem 


n turn head 


(xn Yi 2; ) 
4 Xm 


Zm 


Figure 2-2: Head-medium Configuration 


[20]. Let T(z, ,y,,s,) be the fringing field produced by the read head when it is excited by a current 
which results in a deep gap flux of unity. Then, the by Reciprocity Theorem, the flux linking the the 
read head due to the magnetization M(z,.) n’?m) of the media is given by’, 


é+6 
¢(z) = 4x I. an [ - P [. ere a a . H(z,,¥,)7,) dz, dy, dz, 


w é+6 
= 4n [ i M(z,+2,9,,2,) ° H(z,,y,,#,) ¢z, dy, dz,» (2.2) 
—00 
where - denotes the dot product operation. 


In general, M has all three components M . M, and M , On the other hand, if we ignore end effects in 
the Z-direction, the head field H lies in the XY-plane, and it does not depend on z,. Hence, we can 
expand the dot product in Eq. (2.2) to get 


We shall be using c.g.s. units in this chapter 


w d+6 co | 
¢(z) = anf / [ { M _(z,+2,9,,2,) H(z,,9,) 
+ M (2 ,+2,9,2,) H(z.) }dz, dy, dz, ‘ (2.3) 


The readback voltage” , V (z) at the output of the read head is given in terms of ¢(z) as 


(a) ny sal | (2.4) 


where n is the number of turns in the readhead, and n is its efficiency. 


Recognizing the integration over z, in Eq. (2.3) as a convolution, we can rewrite it as 
w d+6 
¢(z) = 4 [ | [ [ M,(z,y,,2,) * H(—z,y,) ] dy, dz, 


w d+6 
tan ff CM feauye,) © Hom) yy dey, (2.5) 
where @ denotes convolution. 


From the above equation it is evident that the response of the readhead can be considered as the output 


of a linear system to which the magnetization, M(z z,)) is an input. Taking the Fourier transform 


m’Ym’? 


of ¢(z) with respect to the space variable z, we get 
w d+6 - 
Hk) = Role) — an ff Meu.) "Chu, dy, de, 


w d+6 ; 
+ 4n [ : M y*yo?)) x, (k,y,) dy, dz, (2.6) 


where superscript © denotes complex conjugation, and 


or) sib 
X(ky,) = [ 2 H_(z,y,) e 7" dz , 


2We shall be writing the readback voltage and flux as a function of either the space variable z or the time variable ¢, depending 
on which is more convenient, keeping in mind that the z and ¢ are related as z = vt. 


jee) 
H (ky,) aan f H (z,y,) e** de ’ 
— 
M(kivyt,) = fo M,l2ivyz,) o* de, 


00 : 
My(Fivyt) = fo Maye) oF de (2.7) 
From Eq. (2.4), we can see that the Fourier transform of V(z) is given by 


V(k) = sknn Ok). (2.8) 


Using the Karlqvist approximation [9], the fringing field produced by the head, when the deep gap field 
is Hi, has components, H* and Hy, given by 


+ 9/2 z,— 9/2 
) — tan“) ], 


H, 
Hi (2,9,) = —| tan tan™*( st 


H (x caahcal 
Heyy) = - sf o[ > 


> 0. : 
: \s for y, 2 0 (2.9) 


(z, = 9/2)? ae uv, 
We mentioned earlier, when we applied the ixeciprocity Theorem, that the field H(z,.,y,) is the field 


produced by the read head when the deep gap flux is unity ,i.e., when H, g = 1. Hence, by substituting 
1/g for H : in Eq. (2.10), we get 
H(2,v,) — — [tan i Pasa 
ey) = 2 [aor atten: 
h’h ng Y, y, 
1 (z, + 9/2)" + vy," 


A (z,9,) = -s— ——___— |, for y, > O. (2.10) 
ie (z, — 9/2)? + y,” : 


27g 


It can be shown [20] that the Fourier transforms ( with respect to z, ) of the field functions in Eq. (2.10) 


are given by, 
ae g/2) en lk 
H (ky,) = pian X(ky,) for y, > 0. (2.11) 


10 


Using Eqs. (2.11) and (2.6), we can can calculate V(k) for any known M(z_,y_,z_) written on the 
medium. For particulate recording media, the magnetization is not a continuous function of z_ , y,,, and 
z,) because the magnetic particles are discrete. The average response of the head to the magnetization 
of all the particles that it scans at any instant is the same as its response to the average or bulk 
magnetization of the medium as a function of z, because of the linearity property of the convolution in 
Eq. (2.5). This average response shall be referred to as the signal in forthcoming discussions. Random 
variations of the response about the signal can be characterized if we know the response of the head due 


to the magnetization of the individual particles. We shall analyze these two cases in the ensuing sections. 


2.3. Bulk Magnetization Response 


The bulk magnetization of the medium is in the direction of the track, i.e., in the X-direction, for 
Zz) and M (z 


arctangent transition written at z,, = 0, the bulk magnetization is given by 


longitudinal recording. Therefore, M,{z 


m'Ym z) are zero. For a single positive going 


mm 


2M 


a 
é a | m 
M,{z,) n’=m) ae a tan (—) ’ (2.12) 


where M 7, 8 the saturation value of the magnetization in the medium, and a is the transition width 


parameter. 


Computing the Fourier tranform of the arctangent magnetization is not straightforward since the 


integral of tan“ (z__/ a)| over the interval (—0o,00) is not convergent. If we write 
tan~'(z_/a) = #/2 — tan "(a/z_) ; | (2.13) 


then it can be argued that (20) the Fourier transform of M , With respect to zis given by 


2M. 
M,(k) = 2M, nik) + 5—— ew lle (2.14) 


The delta function in the above equation is misleading because it attributes a d.c. value to the 
magnetization which, as defined in Eq. (2.12), does not have any d.c. value. We shall consider the delta 
function as representing the divergence of the arctangent function. Since the voltage is proportional to 
the derivative of the flux, this introduces a factor of k, which eliminates the delta function completely as 
we shall see shortly. 


Substituting Eqs. (2.11) and Eq. (2.14) in Eq. (2.6), we get 


ll 


n(k 2M, d+6 
Ok) = 4nw cae [ 2é,x Hk) — 5 eH | | ell, dy, 


sin(kg/2) _ 1—e7|Fl =u 2 
SAD ome EE Lamas seeem] am 


= 4rw 
Finally, using Eq. (2.8), 


sin (ko/2) tga Ire EP ia 


V(k) = 8rnnw (5/2) € rs M,. (2.16) 


The following terms can be identified in the above equations; 


Writing Process Loss : 7 /#l@ 


Gap Loss : sc del (+9/2) 
7 (9/2) 
Spacing Loss > eT lke 
1—e7 HIE 
Thickness Loss 


[-| 


All the loss terms tend to suppress the high frequency content of the magnetization waveform so that 
the flux waveform, for a transition written at z= 9, is also a transition at z = 0 but with a larger 


transition width. In fact, if we assume that the transition in the flux is also approximately arctangent in 


shape, i.e., 


2%, os 
o(z) = — tan'| : ); (2.17) 
where ¢ is the saturation value of the flux, then it can be shown [20] that ¢ and a are related by 
] ETAT REELED ENZEEEDELEEL TL TILE 
=. Vo? + 4(d+a)(d+a+6) . (2.18) 


Note that, as expected, ¢ = a for d = 6 = g = 0, i.e., when the recording is distortion-free. 
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2.4. Signal Model 


We can rewrite Eq. (2.17) in terms of the time variable t, by replacing z in the equation by vt. We shall 
also include terms n 7 in the flux expression, so that the readback voltage would be simply the time- 
derivative of the flux written in this way. Hence, 
2¢ 
m _1, ¢ 
t) = ——tan “(—), 2.19 
ot) = — tan“ =) (2.19) 
where o = ¢/v, and ¢,= 10 ¢,. 


The readback voltage as a function of time for a single positive going arctangent transition can now be 


written as 
d 26m, 
e(t) = 77 ¢(t) = py [—] : (2.20) 


1+ t?/o? 
The above model, which can be identified as a Lorentzian pulse with width parameter o, will be used as 
the model for the stgnal pulse in ensuing discussions. Note that in digital magnetic recording, we can 


have both positive and negative going transitions. Both these represent a digital] "1". Hence, the signal 
pulse for digital "1° could be either positive going or negative going. 


2.5. Response Due to a Single Particle 


Let us now consider the response of the head due to a single particle ( sth ) which is represented by a 
bar magnet m of finite length / and infinitesimal cross-section, located at coordinates (z., y., z,) in the 
medium with orientation ( 6, y ) as shown in Fig. 2-2. 


The particle makes an angle 6 with the X-axis. The angle 6 can take on values in the range (0,7].The 
projection of the particle on the YZ-plane makes an angle of » with the Y-axis. The angle can take on 
values in the range [0,27]. If the magnitude of the dipole moment of the particle is Mo) then the 


components of m in the three directions are given by 
m_ = mM, Cos 6, 
m= mM, sin 8 cos p : 


m, = m,sin$siny. (2.21) 


The magnetization of the medium represents the dipole moment per unit volume of the medium. If we 
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consider the medium to be made up of only the ¢th particle, then we can write an expression for — 


M(z_1Ymn%m) 85 


m 
M(z_Yn%m) = 7 d{y_ —y, — (z,,—2,) tan 8 cos p} (2.22) 
x d{z, — z,— (y,—y,) tan} [ u{z, —z,} — u{z, (cos 6 + 2 )}], 


where u{} denotes the unit step function. 


The above expression for the magnetization is only a mathematica] way of representing the fact that any 
volume integral of M(z_sYen?Zm) containing the sth particle can be reduced to a line integral along the 
direction of orientation of the particle. If « denotes the distance parameter along the particle direction, 
then the following constraints hold for the line integral. 

z, = 8 cos + z,, 

y, = esin@ cosy + Ye» 


z, = ssindsiny + z., 


0o<s < l. (2.23) 


From Eq. (2.2) we get that the flux linking the read head due to magnetization of the i-th particle is 
given by 


$() = anf 


d+5 oo 
/- [ [ = M(z__1Yn?2m) . H(z, ,9,>2,) dz, dy, dz, 


of 
w d+6 rove) 
= 47 [ [ [ a M(z_1Ymn%m) : H(z,—z, mn™m) 22m Wn 22mm ° (2.24) 


Again, using the Karlqvist head fields 


w d+6 oo 
o{z) = 4x [ [ [ wt tenn?) A gl2 92 Yen) 
+M AC mY) H(2,,—-2)Y yn) dz dy dz. 


4n f! | 
a = I. { M)cOos 6 HL & cos O+z —z, 8 sin Ocos vty, ) 
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+ m,sin 6cos ¥ HY e cos §+z —z2, 6 sin bcos p+y, ) } ds, (2.25) 
where H, and H, are given by Eq. (2.10). 


Using the shifting property of the Fourier transform we get 


4n f' 7 
${k) = 7. [.. { m,cos 6 H_°( k, ¢ sin bcos p+y, ) 


+ m,sin 6cos x *( k, ¢ sin 6cos p+y. ) } eFhl2 cos 6) + jkr; ay (2.26) 


where H_ and H y are given by Eq. (2.11). Hence, 


4 in (k 
${k) = — my et; eo lKly; aaa { cos 6 — j sgn(k) sin @cos yp } 


| elle sin 6cos elke cos 8 7, 
e=0 


in (kq/2 —|klisin cos p jkicos 6 __ 1 
= arm, oF; elt, HE (k/2) (+) 
0 kg/2 l 
cos § — 3 sgn(k) sin @cos w 
—|k|sin 6cos yp + skcos 6 
eo [Alisin Gcos  pjklcos 6 __ 1 


in (kg/2 
= Anim, oi ely, elie) [ l=: (2.27) 


kg/2 


Hence, the Fourier transform of the read back voltage due to the magnetization of the s-th particle is 


given by, 
: —|blisin 6coa y ,ykicos@ _ y 
= oh ee 
VAk) 4xnnm, e*7; ¢ g/2 ; ; (2.28) 


This is the response due to a single particle. However, during readback, the readhead is influenced by 
the magnetization of a very large number of individual particles, which could have arbitrary 
orientations, positions and sizes. The response of the read head can hence be considered as a random 
process whose mean value is the bulk response which we derived earlier. Variations of the response 
about this mean value can be modeled as an additive noise. In the next chapter, we shall see how the 


above expression for Vk) can be used to calculate the average power spectrum of this noise. 
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2.6. Summary 


In this chapter we provided a fairly detailed analysis of the readback process with the objective of 
studying the frequency response of the channel. We first considered the response of the channel to the 
bulk magnetization of the channel and showed that the channel behaves as a cascade of loss factors, 
namely, gap loss, spacing loss and distance loss. We then considered the response of the channel due to 
a single particle in the medium. This will be used in the next chapter to calculate the average power 


spectrum of media noise. 
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Chapter 3 


Average Power Spectrum of Media Noise 


3.1. Introduction 


In Chapter 2, we mentioned that the fact that particulate recording media are made up of discrete 
magnetic particles causes random deviations about the bulk or average response of the readback 
channel. In order to quantify these deviations, which we shall refer to as media notse, we need to 
characterize the noise stochastically. One of the ways to characterize the noise partially is through the 
use of an average power spectral density description. We emphasize the word partially, because, as we 
shall show in the next chapter, average power spectral densities yield a complete characterization only 
when the noise has the special property of stationarity, i.e., the statistics of the noise are insensitive to 
time shifts. It has been well established that particulate media noise is signal dependent and hence does 
not possess the property of stationarity. The advantage of using a power spectrum description, however, 
is that the power spectrum of the noise can be easily measured using a spectrum analyzer. Comparison 
of experimentally observed average power spectra with those obtained from theoretical modeling can 


yield a good insight into the nature of the noise. 


Most of the existing models for the average power spectrum of the noise use the frequency response of 
the head due to an individual particle as a starting point, and compute statistical averages of this 
response to obtain the desired average power spectrum. One of the first papers that discusses such an 
approach is by Mallinson [13]. He assumes that all particles are identical and have their orientations 
perfectly aligned with track direction, so that the only randomness is in the eign of their magnetic 
moments. Thurlings [18] has done a similar analysis without the assumption of perfect alignment of 
particles and with clustering of the particles taken into account. Anzaloni and Barbosa [1] have extended 
Thurlings’ analysis in an attempt to explain the dependence of the noise power spectrum on signal 
frequency. Tarumi and Noro [17] have identified another contribution to media noise, namely, surface 


asperities which cause random fluctuations in the head-to-medium distance. In this chapter we shall use 
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ideas from all these papers to arrive at a model for the average power spectrum of particulate media 


noise. 


3.2. A Simple Model for Particle Interactions 


A particulate magnetic medium is a dispersion of elementary magnetic particles in a binder; the 
position, shape, and orientation of the particles are random variables whose statistics are governed by 
the manufacturing process. In the dispersion process, the particles do not distribute themselves 
uniformly and independently; being magnetic, they interact with each other to form agglomerations. 
The magnetic interaction of the particles in the medium is a complicated matter, requiring the 


simultaneous consideration of all the particles in the medium. 


head 


clusters 
medium 


=> 
Za 


subclusters 


SS === Cc» equivalent particles 


at high writing frequencies. 


Figure 3-1: A simple model for particle interactions in terms of clusters and subclusters 
In order to arrive at a simple model, we assume that the interactions are either very strong or weak. 


Fig. 9-1 is an attempt at representing these two types of interactions pictorially. Very strong 


interactions cause the formation of eubclusters, which are sets of particles strongly interacting with each 
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other and weakly interacting with all other particles not belonging to the same subcluster. Since the sign 
of the magnetic moment of the particles also depends on the applied external field, the final state of 
magnetization of the particle depends on the applied field and on the attributes of other members of its 
subcluster. For a slowly varying applied field, the magnetic moments of all the particles of a subcluster 
are forced to have the same sign. However, when the signal frequency is high, strongly interacting 
particles in a subcluster may end up with opposite polarization, canceling each other and thus reducing 
the equivalent size and magnetic moment of the subcluster. Under these conditions we can consider the 


subcluster to be an elementary particle whose size decreases as the signal frequency increases. 


Weak interactions between subclusters, during the manufacturing process, cause them to form clusters. 
The horizontal position of a subcluster within a cluster depends on other subclusters within its cluster, 
but is independent of the subclusters not belonging to its cluster. We model the cth cluster by picking an 
integer N from a given distribution and putting together N. subclusters. We shall assume that the 
horizontal positions u of the centers of the clusters are uniformly distributed along the track. The 
horizontal position w of the subclusters with respect to their respective cluster centers is drawn from a 


distribution with probability density function p(w). Hence, the horizontal position z of the subclusters in 
the medium is given by 


z=utw. (3.1) 


The ;-robability density function of z can be shown to be [14] the convolution of the density function of 
u and the density function of w. The density function of u is uniform over the entire length of the tape 
whereas the density function of w, p(w), is limited to the region of the cluster. The convolution of the 
density function of u with that of w is hence also approximately uniform if we ignore the distortions at 
the medium boundaries. Hence, the distribution of the horizontal positions z is approximately uniform. 


Similarly, the distribution of the vertical positions y of the particles is also uniform. 


3.3. A Model for the Average Power Spectrum of the Readback Voltage 


The readback voltage can be considered as a random process r(t) whose mean value is the desired signal 
a(t). The noise in the readback voltage n(t) is defined to be any random deviation from the signal so 
that, by definition, n(t) is zero-mean. Hence, if we model the average power spectrum of r(t), we can 
obtain a model for the average power spectrum of the noise from this by subtracting the average power 
spectrum of s(t). The average power spectrum of r(t) is defined by [14] 
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3 im Se E( | | elt) oH at [? 
iw) = Boel ot | [1 ec dt |*}. (3.2) 


Eq. (3.2) is called the time average definition of the average power spectrum of a random process. In the 
next chapter we shall be defining the average power spectrum in a different way in terms of the Fourier 
transform of an average autocorrelation function. The reader may be familiar with either of these 
definitions. We shall show in the next chapter that the two definitions are equivalent. We can rewrite 


Eg. (3.2) in terms of space variables, using z=vt and w=kv, as 


| 7 & 
Bk) = lim SFI [ nha) e™ dz} (3.3) 


In the limit as T -+ oo, we can replace the integral in Eq. (3.3) by the Fourier transform of r{z) to get 


S(k) = lim — k) |? : 
(b) = tim ECL olh) PY, (3.4 
where p(k) is the Fourier transform of a sample realization of the random process r(z). 


If N, is the total number of particles ( subclusters) in the volume of the medium from —vT to uT, then 


N. 


re 
ak) = D> Vik), (3.5) 


t=] 


where V{k) is the Fourier transform of the voltage contribution due to one subcluster as given in Eq. 
(2.28). 


We can rewrite Eq. (2.28) as 
Vik) = 6, e*i Hi{ky,m,,0,¥,) , (3.6) 


where 6. is the sign of the magnetic moment of the particle, @ is now restricted toO < 9 < 1/2 and H, 


is given by 
: —|klisin @cos p ,jkicos@ 
_ —|nly, #0 (Fo /2) E 3.7 
H, 4nngnm,e "i kg/2 ; , (3.7) 
Therefore 
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N, 
T 
1 
. * 2 
5(k) = lim oF 2 8, H, ei |? } 


N,N, 


| rt Nr 
== lim HY 4H, HS ez ,-2)) } 
T— 0 i=l t=) 
Nr Ny Nr 
= lim sal © EHP} + Bl e6 HH MY). (8.8) 
ies fer] lx=1l xs 


Let the N,subclusters be distributed among clusters with N, subclusters in cluster c. With the exception 
of z. and é., we assume that all the random variables associated with a single subcluster are statistically 


independent. Furthermore, if we assume that the subclusters are statistically identical, then we can 


write the above equation as 


Ny Nr : 
5 (k) = lim = —| Bi) N, + Bi) > Atk) | , | (3.9) 
fez] fax} l yf ¢ 
where 
Bk) = E{|H,)}, 
Bik) = | EC H,;}/, 
A(k) = E{ 6, 0, *i-*)) } . (3.10) 


The sign of the magnetic moment of a subcluster will "follow" the magnetization written on the disc. In 
other words, the probability of the sign being positive is larger at places where the written signal is 
positive and vice versa. One of the ways of describing this dependence is through the use of the following 


joint probability density function for the random variables s and z. 


jls,z) = ; —| {1 + i &se—-1) + {1—- 7 a(6+1) | (3.11) 


Using the above joint density function, we compute A(k) for the following two cases. 


Case 1: 


The subclusters ¢ and / are located in different clusters; i.e., the random variables z, and z, are 


independent. Then we can write 
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A(k) = E{ee**i} E{ee**1} = | E{s e**} |, | (3.12) 


where 


: 1 vT M(z) . 1 vT M(z) : 
Ee) = ef ata eo eee) ee dz 


= M. (in the limit as T-+ 00 ). (3.13) 
Therefore 


1 M(H)? | 
A(k) = 7 vr (3.14) 


The total number of such contributions to the double summation in Eq. (3.9) is given by 


DD LL Ng: 


¢ de 
Next, we consider the case when the subclusters ¢ and / are located in the same cluster. 


Case 2: 


The subclusters ¢ and / are located in the same cluster; i.e., 

z= ut, and z= u t+ wy. 
Now, the random variables z, and z, are not statistically independent but u, w, and w, are independent. 
Hence we can write the joint probability density function of 8, 8, U, wW, and w, in the following way. 


J (8,6,u,w.w,) = f(¢,w,/u) f (¢,w,/u) f (u) 


1 P(w,) Mu+w,) M(u+w,) 
= or 2 1+ —M } &s —1) fhe  M, } as, +1)| 
p(w,) Mu+w,) M(u+w,) 
x —|k +z) Aer) + {1 — aq) Aera)]. (3.15) 


Therefore 
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Ak) = E{ ee, ei“) } 


1 M(u+w,) M(u+w,) ee ae 
— sah [Lf eedeuy—y— a Mr? a, dy, a 


1 [/ M(u+w,+z) M(u+w)) aes - 
=o. ha / p(w,+z) rv) a M, z Y u. (3.16) 


Now, in the limit as T— 00 


[ p(w,+z) p(w,) dw, = p(z)*p(—z) = F(z) ; (3.17) 
af 


and 


/ Mut+w,t+z) M(u+w) du = M(z)eM(-z) = R,f2), (3.18) 


where F(z) and R,{z) represent the autocorrelation functions of the deterministic functions p(z) and 


M(z) respectively. 


Hence, 


A(k) = 


seg hi FF az 


— ea [ F{ Rlz) oF { Ryfz)} J). (3.19) 


But from Eqs. (3.17) and (3.18) we get 


F{R(2)} = | Arlw)} ? = Sk), 


F{R,f2)} = | AM(z)} |? = | M(a)/. (3.20) 
Therefore 
M (k) |? 
A(k) = 7 5,(h) « a (3.21) 
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The total number of such contributions to the double summation of Eq. (3.9) is 
>. N(N-})- (3.22) 
c 


The total average power spectrum of r(t) is obtained by summing all the contributions of the type given 


in (3.14) and of the type given in Eq. (3.21), and substituting in Eq. (3.4). We get 


M (k) |? 
5k) = tim . sal BOD N, + Bik) rae N,N, 
c exe 
M (k) |? 
5 Bk) —| S i(k) « a] > N, (N,-1) | (3.23) 


Now we shall try and quantify the statistics of the clustering process. We shall assume that the 


probability of having exactly N clusters in length L ( = 2vT ) of the medium is Poisson distributed with 


Poisson parameter a, i.e., 


ANL) = ae (3.24) 
and 
E{N} =Var{N} = aL. (3.25) 


Hence we get the following. 


N 
aD N,} = E{ > N,N} =vaN, 


~ 1) 
N,Nj} = E{ ———WN?} = a? N? 
are: c=] a! J 
apes N——- ces 
ED N(N-1)} = E{ +N, (N-1)} = aN (N-1), (3.26) 
cx] 


’ 


where the overbars indicate average values of the quantities below. 


The term 
l 
? Ee k 2 3.27 
ae Sor! M (k) | (3.27) 
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represents the average power spectrum of the magnetization written on the medium, which we shall 


denote by 5 M+): 


Substituting Eq. (3.27) and Eq. (3.26) in Eq. (3.23), we get 


| SA) et) 
5k) = a NB lk) + a Bk) N(N-1) | st S,(k) | + 0? B,(k) Re (328) 


The third term in the RHS of Eq. (3.28) is the average power spectrum of the stgnal. The relationship 
between this and the bulk magnetization response derived in Chapter 2 is not immediately obvious, but 
will become more apparent when the expression for B,(k) is written out. The first term in the RHS of 
Eq. (3.28) is the signal independent background noise. The second term, which arises due the 
phenomenon of clustering, is the signal dependent modulatton noise. We can analyze these terms in 


more detail if we find B)(k) and B,(k) in terms of the statistics of the random variables associated with 


the particle. 


Since we have assumed a uniform distribution for the vertical position y, of a subcluster, we get 


d+6 -— lly. as miley 
—|kly, pod ae — —|kid (i-e™) 
E{ e ©; } : ; dy e me 


Ce ay = ertte 


5 rr 


E{ eH; x [ = (3.29) 


Also, we can simplify the expression for H, in the following way. 


eo lalisin Ocos $ erbicos @ oe, | : ( 7 [Alisin bcos p cos {kicos s}—1 )? 


l (2 


( e7 lklisin cos » 55 {klcos 6} )? 
[? 


7 2lalisin Gcosp _ D¢7 lelisin bcos ¥ O65 {klcos é}+1 | 
a, (3.30) 
[? 
With the assumption that [k| < 1/I ; i-e., for frequencies much less than the particulate limit, we get 
e~2lklisin bcos oy — QI kllsin Pcosy + k? I? sin 26 cos *y 


Qe lelisin fcos¥ gy 9( 1 — [k|l sin @ cosy ) 
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k*I? cos 26 
cos( kl cos§) es 1 — aa aa (3.31) 


Therefore 


[Aisin bcos p pykicos 8 __ 
; |? ss k*[cos2@ + sin76 cos*y]. (3.32) 


Using the above approximation and Eq. (3.29), we can now write expressions for B,(k) and B,(k) from 
Eqs. (3.10) and (3.7) as 


_ sin (kg/2) 2 ,2 —2lkld (1—e~ 2118) 
Blk) = Ay | kg/2 fee Be” 
sin (kg/2) _ (1—e~|#I5) 
Bk) = A, ens lh k? ¢~2lkid hom i (3.33) 
where 
A, = (4x09 is m,° E{ cos 76 + sin 76 cos 74) VG 
A, = (4rnn)? my” [ E{ Vcos 76 + sin 76 cosy } ]?. (3.34) 


We observe that B,(k) has all the loss terms that we derived in Chapter 2 when we analyzed the signal. 
The term B,(k) has nearly the same functional dependence on k indicating that the background noise 
has a power spectrum which is very nearly equal to the channel response. This means that the 
background noise can be modeled as a white noise source which is added to the magnetization M(z) and 


it gets modified by the readback channel in the same way as M(z). 


We also observe that B.(k) and B,(k) depend on the value of the magnetic moment m, of the subcluster. 
As we mentioned earlier, m, ieceeanes with increasing writing frequency, and, hence, B,(k) and Bk) 
are scaled down by the same amount with increasing writing frequency. This means that al] the three 
terms in the average power spectrum of the readback voltage get scaled down by the same amount with 
increasing writing frequency; but the signal-to-noise ratio remains unaffected. We shall make use of this 


fact when we model the noise in the time domain in the next chapter. 
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3.4. Noise Average Power Spectrum 


From Eq. (3.28) we can get the average power spectrum of media noise 5 (k) by subtracting the signal 


power spectrum, i.e., 


S Ak) 


M? 


3 (k) = ao NBAk) + @ Bk) N, (N-1) [ ; s(x) |. (3.35) 
As mentioned earlier there are two components in the noise power spectrum, background noise and 


modulation noise. We shall now discuss both these terms in more detail. 


3.4.1. Background Noise 


The background noise as the name suggests is present even when there is no recorded signal. This noise 
can be best described as the output of the read head when the medium ks been a.c. erased to ensure 
that the bulk remanence in the medium is zero. Background noise arises purely from the fact that the 
medium is made up of discrete particles, and would be present even if there was no clustering. As 
mentioned earlier, this noise term can be considered as white noise which is "colored" by the readback 
channel. This is the type of noise that is encountered in most communication channels and a host of 


communication theory results have been developed for signal detection and estimation in the presence of 


such noise. 


3.4.2. Modulation Noise 


This noise term arises as a consequence of the clustering phenomenon. In fact, the term that 
“modulates" the magnetization in Eq. (3.35) is directly related to the probability density function of the 
position of a subcluster within a cluster. The simplest way to measure modulation noise is to record a 
periodic magnetization pattern on the disc and to observe the average power spectrum of the readback 
voltage around the fundamental frequency and harmonics of the periodic pattern. The modulation noise 


appears as “shoulders” around these frequencies. 


It should be mentioned here that particle clustering is not the only reason for a modulation noise term. 
Tarumi and Noro [17] have shown that asperities on the surface of the medium also cause a modulation 
noise which has much smaller bandwidth than the modulation noise produced by clustering. In the next 


chapter we shall attempt to obtain a general time-domain model which could be made to include both 


these modulation noise terms. 
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3.5. Summary 


In this chapter we derived an expression for the average power spectrum of the readback voltage based 
on a simple model for particle interactions within the particulate recording medium. This simple model 
for particle interactions classified the interactions to be either very strong or weak. Weak interactions 
cause the formation of clusters, and very strong interactions produce subclusters. We showed that 
clustering of particles causes a modulation noise term which is signal dependent. Subclustering causes 
the two noise terms and the signal to depend on the writing frequency. We noted that even though both 
signal and noise depend on the writing frequency, the signal-to-noise ratio is independent of the writing 


frequency. 
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Chapter 4 


Time-domain Model for Media Noise 


4.1. Introduction 


In Chapter 3 we established that the noise from particulate recording media consists of a signal 
independent background noise term and a signal dependent modulation noise term. We made an attempt 
at obtaining an exact formulation for the average power spectrum of the noise in terms of the various 
parameters, both random and deterministic, associated with the recording system under consideration. 
In the past such a characterization of the noise has been mainly used to calculate, in a very restricted 
sense, the signal-to-noise ratios in these systems [13]; very little attempt has been made to characterize 
the noise for the purposes of designing bit detection schemes in the presence of this noise. Our goal is to 
evolve criteria for designing bit detection schemes based on the statistics of the noise. Hence, we need to 


characterize the noise completely in the stochastic sense. 


Media noise, observed as a function of time ¢ at the output of the read head, can be considered as a 
sample realization of a governing stochastic process. Typical media have about 1014 particles per cm® so 
that the read head scans some 10° particles every instant [12]. Since the noise is the combined effect of 
the randomness in the location and orientation of such a large number of similar particles, by the 
Central Limit Theorem [14], it is reasonable to assume that the governing process is Gaussian. This fact 
has been further corroborated by experimental work done by Filar and Wright [8]. Recent work by 
Barbosa and Anzaloni [2] demonstrates that we do not need to make the Gaussian assumption since the 
N-th order probability density function of particulate media noise can be obtained from some simple 
particle interaction models. However, the Gaussian assumption makes stochastic analysis tractable 
because Gaussian random variables have very useful properties [14]. A host of communication theory 
results have been derived using this assumption, and applied in many practical cases for which the 
statistics is far from Gaussian. Since in our case we have strong reasons to support the fact that the 


noise is Gaussian, we are more than justified in making this assumption. 
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A Gaussian noise process n(t) is completely characterized stochastically by its mean function n(t) and its 


autocorrelation function R,(t,t+7), defined by , 

n(t) = E{n(t)}, (4.1) 
and 

Ri(tt+r) = E{ Ln(t)— oft) ] Enlt+s) — alt+s}, (4.2) 
where E{.} denotes expected value. 


The noise process is said to be wide sense stationary if n(t) is constant for all t, and F,(t,t+7) does not 
depend on ¢, ie. , n(f) =, and Ff (t,t+7) = R(r). It should be noted that for a Gaussian process, 
wide sense stationarity implies stationarity in a strict sense [14], i.e., the N-th order statistics of the 
process depends only on time differences. Hence, we shall be using the terms “stationarity" and "wide 


sense stationarity” equivalently. The Fourier transform of F (7), denoted by S (w), is called the power 
spectral density of the stationary stochastic process. 


For a non-stationary Gaussian process with constant mean, an example of which is media noise which 


has zero mean, the autocorrelation function R, (t,t+7) is not independent of t. In this case, we define the 


average autocorrelation function of the non-stationary process as [14] 


_ 1 ft 
R (7) oe a. T [ a R(t,t+7) dt . (4.3) 


The Fourier transform of R (7), denoted by 5, (), is called the average power spectral density of the 
process’ . We can obtain R (1) from measurements of 5 (v) using the inverse Fourier transform. 


However, in general, measurements of 5) cannot be used to obtain the autocorrelation function 
R,(t,t+7) which is needed to completely characterize a non-stationary Gaussian stochastic process. 


Hence, the adequacy of the average power spectrum derived in Chapter 3 in characterizing media noise 


is questionable. 


Here, a time-domain model for the noise, which expresses the non-stationary noise process in terms of 
two stationary stochastic processes and a deterministic function of time, is presented. The deterministic 


function of time depends on the signal and it reflects the signal dependent nature of the media noise. 


SS 0e the Appendix section at the end of this chapter for details 
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The autocorrelation functions of each of the component stationary stochastic processes are determined 
from a set of spectrum analyzer measurements. These are then used to determine the autocorrelation 


function of the non-stationary media noise when a general deterministic signal is written on the medium. 


4.2. Time-domain model for media noise 


We model the non-stationary media noise n(t), as seen at the output of a readback channel in which an 


inductive head is used, as follows : 
d 
n(t) = X{ nt) a) ) + nt), (4.4) 
where ¢(t) is related to the readback signal s(t) as 


s(t) = aera (4.5) 


The function ¢(t) is directly proportional to the net flux linking the read head, unless equalization is 
used in the readback channel. The terms n,(t) and n,(t) are assumed to be zero-mean stationary 
Gaussian stochastic processes. We also assume that n(t) and n,(t) are independent, so that expected 
values of the cross products of n,(t) and n,(t) are zero. With these assumptions, if R, and FR, (n) are 
respectively the autocorrelation functions of n,(t) and n,(t), we can express the autocorrelation function 


FR, (t,t+7) of n(t) in terms of these as follows. 


Ri(t,t+r) = E{ n(t+7) n(t) } 
= E{ [n,(t+7) —o(t+1) + ¢(t+7) <n,(t+7) + n(t+r) ] 
[n) <4) + 6) Salt) + a(t) 
d d d d 
= E{n,(t+7) n,(¢) } a fltt7) rau + E{ n(t) ailtt) }o(t+1) 5,0) 


d 
+ E{ n,(t+7) =n,(t) } (2) Folt+2) + Ef <n,(t+2) LU } o(t+7) o(t) 


+ E{ n(t+7) n,(t) } . (4.6) 


Now, since n,(t) and n,(t) are stationary processes 


E{ n(t+1) nt) } = R, (9), 
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Also from results given in {14] for derivates of stationary processes, we have the following. 


d 
B{ ni(t+9) 0) } = RA) = - SRO, 


d d d 
E{ alt+s) n(t)} = Ryn Ao?) a? R, (-7) oT ae R, (7) 
dd d a’ | 
Ei on (9 on (t+7) } = Tn, in A) = oan (9 - (4.8) 


Substituting Eqs. (4.7) and (4.8) in Eq. (4.6), and using Eq. (4.5) we get 
R d d p d as 
za(ttte) = Fale+e) SHO) R, () + ote) TA) TR, 9 


— ¢(t Sie op — g(t t © oR R 
(t) Fo(t+7) — a (7) ¢(t+7) ot) 2 n 7) + a7) 
= a(t+r) 6(t) R, (9 + $(t+r) o(t) R, 
— $(t) o(t+7) R, (0) — ¢(t+r) o(t) RM) + R, (7), (4.9) 


where superscript ’ stands for differentiation with respect to 7. 


The average autocorrelation of n(t) can be obtained from F, (t,t+1) as follows. 
7 1 7/2 
(r) = lm =| R (t,t+r) dt 
lt Tc TJ-7/2 yer 
1 7/2 
= FR (r) lim = [ e(t) s(t+r) dt 
wi) tim af, a e+e 
+ <r (1) i =f o(t+r) o(t) at 
—FR (r) lim - t) 8 
dr " T-co I —T/2 


Se ian) at 
- $F,(0) tim = [ nya HE) at 


d° . 2 {7 4.10 
~ = R, (7) im 7 [ -- ¢(t) d(t+r) dt + R, (0) . (4.10) 


rs gl) denotes the average power spectrum of ¢(t), then we get the following equations in the limit as 
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rs g) denotes the average power spectrum of ¢(t), then we get the following equations in the limit as 


T tends to infinity. Here ¥ {} denotes the Fourier transform with respect to r of the quantity within the 
brackets. 


F{ lim fot) o(t+2) dt} = tim Leu) o"Qw)— tim 2 e = 5 
( Hin fig M0) M42) dt} — Tim 7.844) (0) = Him 7] Hu) P= Fo), 


rim tf oH etenat) = tim LE ¢° 
( Hina fg OO) alte) at) = im 5 Ls Ou] OC) 


T -+ 00 
1 
= li ear Oo ee is 
oe ze | Pw) | go 5 (wv) , 


x( lim =f d lim 7 
( Hin fg MHD Ue) ae) — im 5 Mu) Lie He] 


T - 0 


= lim 5-56) 1 He) F = - HT YW), 


T/2 
F { a = [ as e(t) e(t+r) dt} =— Lae = Lie F(w)] [xu O(w)]” (4.11) 


a ee 2 2 
= gen T | Hw) | a ae 5) ’ 


Let S_ (w) and S_ (w) denote respectively the power spectral densities of n,(t) and n,(t). Then we get 
1 0 


the following set of results from well known properties of Fourier transforms of derivatives. 


FLER (9) = iS, (0), 


d? ; 
F { 72 R, (7) } = -w 5, (+) : (4.12) 


From Egs. (4.10), (4.11) and (4.12), and by making use of the fact that the Fourier transform of a 
product of functions is the convolution of the Fourier transforms of the functions, we get an expression 


for the average power spectral density 5 (w) of n(t) as follows. 
Siw) = F( R(x) } 


= 5, )* [oS (0)] + 2LeS, ()] +13) 
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+ [wS (uw) ]* Siu) + S, w) 
1 0 


= we" W) Ww W). 
CS, (4) © 50) + 5, (w) (4.13) 


The transition from the second line to the third line in the above equation may not be obvious to the 


reader. We have provided the details of this transition in the Appendix section at the end of this 
chapter. 


We shall show in the next section that the average power spectral density of Eq. (4.13) is consistent with 
spectrum analyzer measurements of the noise power spectrum for specific signals. The agreement of Eq. 
(4.13) with experimentally observed power spectra coupled with a close look at the physics of the 
readback process provides a theoretical basis for the model in Eq. (4.4). 


If we compare the average power spectral density of Eq. (4.13) with the average power spectral density 


of the noise derived in Chapter 3 ( Eq. (3.35) ) we find that they are very similar. For the purposes of 
making a comparison we shall rewrite Eq. (3.35). 


sete Oe) 
3 (k) = a NB(k) + a Bk) N(N-1) es s,(k) |. (4.14) 


&é 


Comparing Eqs. (4.13) and (4.14), we see that the first term on the RHS of Eq. (4.13) represents 
modulation noise and the second term represents background noise. We note that the modulation noise 
term, i.e., the convolution of 5, (4) with 5 gl); arises due to the multiplication in the time-domain of a 
stationary noise process n,(t) with the flux ¢(?). In the ensuing discussion we shall show that the choice 


of this multiplicative noise model for modulation noise was not made arbitrarily. 


In Chapter 3 we saw that the phenomenon of clustering was one of the ways of explaining the 
modulation noise term in the noise average power spectrum. If we follow the steps taken to go from Eq. 
(4.4) to Eq. (4.13), it is easily seen that a time-domain ( or space-domain ) model for the noise which 
produces the convolution term 5 MA)? S AK) of Eq. (4.14) would involve the multiplication of the 
magnetization waveform with a stationary noise process which represents clustering. This multiplicative 
noise term then passes through the readback channel, represented by B,(k), and gets affected by the loss 
terms and the differentiation of the er head just as the signal does. 


Now, the model for modulation noise described in the above paragraph is more reasonable than the 
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model of Eq. (4.4), but obtaining the parameters of the model is going to be very difficult because we 
would need to calculate B,(k) which requires a characterization of the various random and deterministic 
variables that are associated with the recording system. Also, when bit detection is done, what is 
available at the input of the detector is the readback voltage and not the magnetization. A model which 
expresses the signal dependent noise in terms of easily available quantities is hence more useful in 


designing optima! bit detection schemes. 


The approximation that we have made in the model of Eq. (4.4) is that the signal dependent noise 
multiplies the flux and not the magnetization. The approximation is much better if equalization is used 
in the readback channel, because the flux waveform would then have approximately the same shape as 
the magnetization written on the medium [20]. The flux as defined in Eq. (4.5) is easily obtained from 
the readback signal by integration. The process of integration does not, however, yield the d.c. value of 
the flux. But this should not pose much of a problem because, as we shall see in the next chapter, the 
d.c. value of the flux is usually known beforehand. 


Pd 


4.3. Experimentation 


The functions 5, () and S,,() were obtained with an HP8568B spectrum analyzer on a recording 
system using a 750 Oe, 0.75 ym thick particulate disk and a MnZn-ferrite recording head with a 0.375 
pom gap length. 5, () was measured by amplifying the readback voltage from an AC erased disk. It is 
noted that this background noise also includes noise from disk imperfections, and non-media noise such 


as head noise and instrumentation noise. The resulting S, (w) is shown in Fig. 4-1. 
0 


The signa] dependent modulation noise term shows up as a convolution of the average power spectrum 


of the flux 5 ) with 5, () in Eq. (4.13). When a periodic signal is written on the disc, 5 g) consists 
of delta functions at the fundamental frequency and the harmonics of the signal. The convolution of 
S =) with these delta functions results in shifted versions of 5,,,() around these delta functions scaled 
by the average power in ¢(t) at the corresponding frequencies. Hence, to measure 5, () square wave 
signals were written on the disc at frequencies greater than the bandwidth of 5, (+). Saturation 
recording was used so that the flux ¢(f) in all these cases had the same amplitude. Fig. 4-2(a) shows a 
string of pulses measured at the eainst of the read head when a square wave is written on the disc at a 
frequency of 1 MHz. Fig. 4-2(b) shows the corresponding ¢(t) obtained after integration and d.c. 


correction. The saturation value of the flux can be obtained from this plot. 


Curves (a}-(c) in Fig. 4-3 show plots of these modulation noise power spectra measured in a bandwidth 


35 


of 1.0 MHz around the fundamental frequencies of three square waves, written at 500 kHz, 1.0 MHz, and 
1.5 MHz. Since the power contained in the fundamental frequency of ¢(t) in all these cases is the same, 
the difference in the modulation noise term w* { 5 gl) af 5%) } around the three center frequencies 
should be due to the multiplication by w*. In order to verify this, we performed a division by w” in all 


the plots. The resulting plots of 5 g) * S_ (w), which are simply scaled versions of S| (w), are shown in 
1 1 


Fig. 4-4, curves (a)-(c). As expected there is considerable similarity in these plots. 


It is well known that d.c. erased noise is larger than a.c. erased noise in particulate media [18]. This 
increase can be explained by our noise model as a modulation noise around w = 0. To verify this, we 
saturated the disc using a large d.c. erase field and measured the modulation noise power spectrum up to 
a frequency of 500 KHz ( see Fig. 4-3, curve (d) ). Fig. 4-4, curve (d) is a plot of the power spectrum 
after division by w*. The 3 dB difference between Fig. 4-4, curve (d) and Fig. 4-4, curves (a)-(c) can be 
explained by the fact that the d.c. modulation noise shoulder is one-sided. 


It should be noted that measurement of modulation noise cannot be performed independently of the 
background noise. Hence, to obtain the average power spectrum of the modulation noise alone, the 
background noise power spectrum has to be subtracted from the total measured power spectrum. Also, 
the signal spikes and the very narrow band modulation around them, which is due to fluctuations in disc 


velocity , must be suppressed. This subtraction was done to obtain all the modulation noise power 


spectra. 


We computed 5,,() from the data that is plotted in Fig. 4-4. The autocorrelation functions R, (9) and 
F,, (9 are computed from 5, (4) and S (4) using the inverse Fourier transform. Plots of F, (9) and 
R nl) are shown in Figs. 4-5(a) and 4-5(b), respectively. We observe that both R, () and R,, (7) look 
very much like sinc functions, indicating the band limited nature of these noise terms. We can see that 
R,, ) has a very small width in the time domain 

( = 0.1 y—sec ), indicating that it has a large bandwidth. The modulation noise term, however, has a 


much larger width in the time domain ( = 0.05 m-sec ), which indicates its small bandwidth in the 


frequency domain. 


From R, (r) and R, (7), we can obtain the desired autocorrelation function F(t,t+r) for any general 
1 0 
magnetization pattern written on the disc. In the next chapter we shall see how we can make use of this 


autocorrelation function in designing optimal bit detection schemes. 
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4.4. Discussion and Conclusions 


Most of the existing theoretical and experimental work regarding particulate media noise has dealt only 
with its average power spectrum. The emphasis in the past has been to come up with an exact 
formulation of the average power spectrum of the noise in terms of the various parameters associated 
with the recording system. The model that we have presented in this chapter has parameters which can 
be obtained from simple set of experiments, and it is general in the sense that it can be made to include 
any noise terms which are similar in nature to the background noise and modulation noise of the media 


noise. For example, the modulation noise due to clustering and that due to surface asperities can now be 


treated on the same footing. 


Since average power spectrum measurements cannot be used directly to obtain the autocorrelation 
function of particulate medix noise, Tang [16] suggested the use of time-domain measurements to obtain 
the desired autocorrelation function. These time-domain measurements are far more complicated than 
the spectrum analyzer measurements and are prone to timing errors. The main point made in this 
chapter is that we can in fact use spectrum analyzer measurements to obtain the autocorrelation 
function of media noise. Also, unlike Tang’s time-domain method which can only be used to determine 
the noise autocorrelation function for a specific signal, R, (7) and F,, (7) as presented in our model can 


be used to deduce the noise autocorrelation function for any general signal written on the disc. 


It should be noted that this simple noise model does not explain some of the observed features such as 
the decrease in total noise power with increase in writing frequency in case of particulate recording 
media. But as we mentioned in Chapter 3, both the signal as well as noise terms are affected in the same 
way when the writing frequency increases, maintaining the same signal-to-noise ratio. We shall see in the 
next chapter that in the design of optimal bit detection schemes, it is not the absolute values of the 
signal power and the noise power that are important but rather the ratio of these two. Hence, we are 
justified in using the model as a useful tool in designing optimal detection schemes for recorded digital 


signals in the presence of media noise. 


37 


4.5. Appendix 


4.5.1. Details of the derivation of Eq. (4.13) 


Consider the following example. 


The function 2{t) is obtained from two other functions z(t) and y(t) by taking the second derivative of 
the product of z(t) and y(t). We can express 2(?) as 


a? 
+t) = a2 { 2(t) u(t) } (4.15) 


By expanding the second derivative in the above equation, we get 


2(t) = [So] uo - 2[ <2) [= ua] rs «0 [v0] (4.16) 


Using well known properties of Fourier transforms, we can express the Fourier transform of z(t) in two 


ways. 


From Eq. (4.15), we get 
Zw) = — wu" {X(w) * Yu) }, (4.17) 


where Z(w), X(w) and Y(w) denote, respectively, the Fourier transforms of z(t), z(t) and y(t). 


From Eq. (4.16), we get 
Zw) = — { w Xv) Mw) } — 2{w XW) }e (wMw)} — Xe {wu Yu}. (4.18) 


Equating the RHS of Eqs. (4.17) and (4.18) gives us the following equation which should explain the 
transition from line two to line three in Eq. (4.13). 


uP {X(w) * Yw)} = {w? X(w) * Yw)} + 2 {w X(w)} © {w Yu)} + X(w)» {w? Yw)} (4.19) 
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4.5.2. The Average Power Spectrum Definition 


Consider a random process z(t) with autocorrelation function F_(t,t+7). As in Eq. (4.3), we define the 


average autocorrelation function of z(t) as 


1 /T 
R() = lim = [ _Rj(tit+2) dt. | (4.20) 


The Fourier transform of R (7), denoted by 5(»), is called the average power spectral density of z(t). 


We shall now show that 5 (w) does represent the average power contained in 2(t) as a function of 


frequency. 


Proof : 


If z(t) is passed through a linear system with transfer function H(jw), then the average power spectrum 
of of the output y(t) is related to the average power spectrum of z(t) as [14] 


Bo) = (Hw)? F(a) . (4.21) 


We choose the linear system to be a very narrow band filter around a particular frequency Wy: In 
particular, we choose H(jw) such that 


1/Aw Ju—w,| << Aw/2 
Gay? — { | (4.22) 

0 otherwise 

Now, the total average power in y(t) is given by 
1 7 1 rT 
i ad . t =_ } — t,t dt 
fim 3p f EW} dt = tim =f Ren 
] co 
= Ro) = — [ 5 (w) do 
2 s : (4.23) 
Fes = (wy) : 


We can make the narrow band filter as narrow as we wish, and by doing so we can extract the average 


power of z(t) at w,. This implies that S(w) is indeed the average power spectral density of the random 
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process z(t). This definition of the average power spectral density as the Fourier transform of F(r) is the 


so called ensemble average definition. When measurements are made in practical systems what is 


normally available to us is only one sample realization of the random process z(t). In this case we obtain 


an estimate of the average power spectrum of z(t) as 


T 
Sv) me = [ lt) e ™* at |? 


(4.24) 


We shall show that the above estimate for the average power spectrum is unbiased in the limit as 


T — o0, i.e., that 


Ree E{ Si) } = 5 (w) 


We proceed as follows 


r | rl, 
E( 3{v)} = B= [ atu) eF™ du [a0 ae} 


Therefore, 


lim E{ 3 {w) li = | [me ~IA"—*) dy de 
Tc CS yl) } = Tc 2T be® we . 


1 /T /T 
a =. i. Rit,u)e“**—9) du de 


Also from the ensemble average definition of 5»), we get 


5 (w) = f. R (1) e~*" dr 


[ me Any cee . dt e"*" d 
im — t,t+r) dt e” T 


1 T 
lim — 
T+ 0 2T Jta—T) r=—co 


Rit,t-+1) e~™" dr dt 


(4.25) 


(4.26) 


(4.27) 
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a [ J ; ~ R(t,u) e~("—) du dt (4.28) 
= n =< ju) ¢ u ‘ . 
T — ©6O 2T te==—T J y==—00 


Comparing Eq. (4.26) and Eq. (4.28), we see that in the limit as T —+ 00 they are equal, i.e., 


z ; 1 T — jut 0 
(2) = tim SEI [ a(t) eat |? (4.29) 


The above equation is the definition for the average power spectral density that we used in Eq. (3.2) of 
Chapter 3. Now, even though the time average estimate for the average power spectrum in unbiased, it 


is not conststent, i.e., the variance of the estimate does not tend to zero in the limit as T —+ oo. In fact 


we can easily show that 


jim Var{ Su) } = [35(w)}. (4.30) 


Most spectrum analyzers estimate the average power spectrum of a random process by computing S{(w). 
For stationary processes we could reduce the variance of this estimate by splitting the time interval into 
nonoverlapping intervals and averaging over the estimates obtained in each interval. For a 
nonstationary process, however, we cannot do this, because in this case the nonoverlapping intervals can 
have entirely different statistics. Hence, in order to reduce the variance of the estimate we need to 
artificially introduce cyclostationartty or periodicity into the random process. This is essentially the 


reason for considering periodic written bit sequences when we measure the average power spectral 


density of media noise. 
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Figure 4-1: Power spectral density of background noise 
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Figure 4-2: (a) Readback voltage for a digitally recorded 1 MHz square wave, 
and (b) corresponding flux waveform 
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Figure 4-8: Modulation noise, before division by w*, measured around 
the center frequencies of : (a) w == 500 kHz, (b) w = 1 MHz, 
| (c) w = 1.5 MHz and (d) w = 0 ( dec. ). 


44 


S,, (w) * Flu) dB ( Volt?-sec?/Hz ) 


-230.0 


-240.0 d 
) a 
N 
-250.0 NY, \ 
te " a 
WK ic 
-260.0 my we’ 
} c Men 
te ry" 
I Ps, i, ! un 
f i Eat 
-270.0 | sen 
-280.0 
-600.0 -400.0 -200.0 0.0 200.0 400.0 
of (me w/2e) KHz 
Figure 4-4: Modulation noise, after division by w, measured around 


the center frequencies of : (a) w = 500 kHz, (b) w = 1 MHz, 
(c) w = 1.5 MHz and (d) w = 0( dic. ). 
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Figure 4-5: Autocorrelation functions of : (a) Modulation Noise, 
and (b) Background Noise 
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Chapter 5 


Bit Detection Schemes For 
Erased Signals in Noise 


5.1. Introduction 


In Chapter 2 we developed an approximate model for the signal pulse that represents a digital "1"; the 
approximation made was that the corresponding flux waveform has an arctangent shape. In this chapter, 
we shall extend the signal model and flux model to include the case of d.c. and a.c. erased signals. Also, 
in Chapter 4 we developed a general time-domain model for particulate media noise which allowed us to 
express its two-dimensional autocorrelation function in terms of the one-dimensional autocorrelation 
functions of two stationary processes. We shall approximate the one-dimensional autocorrelation 


functions by reasonable analytic functions in this chapter. 


Using these models for signal and noise, we can analyze the performance of various bit detection 
schemes, both optimal and sub-optimal, which can be used to detect the signal pulse in the presence of 
noise. The criterion that will be used to compare these detection schemes is the probability of bit error ( 
or the bit error rate ) that results when these schemes are employed. The bit error rate for a given 


detection scheme increases with the level of erasure. This dependence will be studied for both a.c. and 


d.c. erasure. 


In order to facilitate the calculation of the error probability we use a discrete-time version of the 
readback voltage for analysis. The readback voltage in a given bit period is discretized into N samples 
which form a vector r. The Lorentzian pulse which represents a digital °1" is discretized into N samples 
that form a vector s. This pulse could be either a positive-going pulse or a negative going pulse as we 
saw in Chapter 2. We shall be assuming that we know the sign of the pulse beforehand. This assumption 
has certain implications, when we consider estimation of a sequence of bits using a bit-by-bit detection, 
which we shall discuss in the next chapter. The two-dimensional autocorrelation function of the noise is 


discretized to form an NX WN matrix called the covarstance matrix. 
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We shall consider four bit detection schemes in this chapter. The first of these is the optimal bit 
detection scheme, i.e., the scheme which yields the lowest bit error rate. We shall show that this 
detection scheme is quadratic, and hence difficult to implement. Next, we shall assume that the detector 
is linear and find the best linear detection scheme. Then we shall consider a very simple linear detection 
scheme called the Correlator. All of the above detection schemes assume a knowledge of the exact nature 
of the signal and statistics of the noise, and are hence referred to as parametric detection schemes. 
Nonparametric detection schemes, on the other hand, require only a partial knowledge of the signal 
shape and noise statistics. We shall consider one such nonparametric scheme called the Stgn Detector. 
For the three parametric detection schemes, we shall obtain analytical expressions for the error 
probability which can be evaluated numerically on a computer for specific test cases. This will serve to 
compare the performance of these detection schemes. For the sign detector, however, it is very difficult 
to obtain an analytical solution for the error probability, and we have not attempted to obtain one. 


Hence, only the detection strategy has been presented in this case. 


5.2. Analytical Models for Signal and Noise 


5.2.1. A Model for the Erased Signal 


We shall assume that erasing the medium causes only the amplitude of the signal to decrease, while 
maintaining the shape ( equivalently the frequency content ) of the signal. This is an approximation 
because in most practical systems erasure causes some distortion in the signal shape. Also, in deriving 
the arctangent model for the flux in Chapter 2, Eq. (2.19), we assumed that the recording was done at 
saturation so that the d.c. value of the flux was sero. This condition of zero d.c. value is not valid 


under conditions of d.c. erasure. Hence, we generalize Eq. (2.19) as follows. 


29¢ 


m t T T 
d(t) 7 tan (55) + ?4. ’ ~ 9 <t <3” (5.1) 


where ¢ is the saturation value of the flux, g is a dimensionless quantity representing the level of 
erasure, JT is the bit period, pT is the transition width parameter, and ¢ de 8 the d.c. value of the flux. 
The quantity p is dimensionless and it represents the normalized transition width parameter, the 
normalization being done with respect to T. The quantity q can take on values in the range [0,1]; g = 1 


represents no erasure, and g = 0 represents complete erasure. 


Under conditions of a.c. erasure, the flux does not have a d.c. component. This condition is described in 


Fig 5-1. In this case, we can write the flux as 
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ee 


m -_1, T 
OA oe ten (oF) a a (5.2) 


tw 


Under conditions of d.c. erasure, ¢(t) is saturated at either +¢ or —?¢,, The d.c. value of ¢(t) in this 


case is determined by both ¢ and the level of erasure g as is shown in Figs. 5-2(a) and 5-2(b). Hence, 


we get 


2 t T T 
¢,(t) = én [= tan? (=) a i n «| ; Pig t<-. (5.3) 


The readback signal in both cases is given by the familiar Lorentzian pulse shape, shown in Fig. 5-3, 


which can be written as 


; dy, 2% m 1 Bt 


If we normalize the flux with respect to @ and the time variable ¢ with respect to 7, then we can 


rewrite Eq. (5.1) in terms of only dimensionless variables‘ as 


2q cts. . 1 1 
t) = —tan “(- ; —--<t<e-. 5.5 
(0) = Stan (=) + dy, 7 St <5 (5.5) 
We also get the following equations as a result of this normalization. 
2q 1,¢ 1 1 
t) = —tan “(- ; —-<t -, 5.6 
,<¢) = Stan (=) -<t<z (5.8) 
2q. .1,%, 4 1 1 
t) = —t - i-<= ig, —--<t<-. 5.7 
d 2¢ 1 1 1 
a(t) = —o(t) = —| ——_||_ ,, —--<t<-. 5.8 
slew tach | Pee ed = St<;5 (5.8) 


Eqs. (5.6) to (5.8), which are expressed in terms of only normalized variables, will be used in the 


numerical computations of bit error rates. 


‘Strictly speaking all the normalized quantities which we shall be introducing in this section should be represented by different 


symbols from those we have used for the unnormalized quantities. The fact that we have not done so should not confuse the 
reader. 
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Figure 5-1: Flux waveform for a.c. erasure 
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Figure 5-2: Flux waveforms for two cases of d.c. erasure ; 
(a) saturated at +¢_, (b) saturated at -¢.° 
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Figure 5-3: Lorentzian signal pulse for an erasure level of g 
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5.2.2. Noise Model 


To characterize the noise we use the time-domain model that we proposed in Chapter 4, i.e., 


alt) = Sn) o()} + ng(t). (5.9) 


As discussed in Chapter 4, the autocorrelation function of n(t) can be obtained from the autocorrelation 


functions of the background noise, n,(t), and the modulation noise, n,(t), as 
R,(tt+7) = R, (7) oft) e(t+r) + R,“() o(t+r) 6(t) 
— R,"(1) $(t) o(t+7) — R. Mr) o(t) o(t+7) + R, (7), (5.10) 
1 1 0 
where superscript ’ stands for differentiation with respect to 7. 


In order to simplify the analysis we shall assume that both the modulation noise as well as the 
background noise terms are band-limited white. If B, and B, are, respectively, the bandwidths of the 


modulation noise and background noise (B, > B, ), then their power spectral densities can be written 


N, 
—_ —2nB, Sw <2nB, 
s,(0) = { | 
1 0 otherwise 
No 
— —27B, < w <27B 
S (wv) = { 2 . : (5.11) 
0 0 otherwise 


From Eq. (5.11), we can compute the autocorrelation functions R, (r) and RF, (r) by using the inverse 
1 0 


Fourier transform to get 


1. 
R, (1 = (20B,7) , 


No 
= — gj : 5.12 
R,, | t) 35, om (27B,7) (5.12) 
If we set 
f, = BT and f, = BT, (5.13) 


where f, and f, represent normalized bandwidths, then 
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T 
R, = sinc( 2f, 7) ; 
Nolo . T 
F,, = ar sinc( 2fo 7) ; (5.14) 


where the function sinc(-) is defined by 


sin (72) 


sinc(z) = ae (5.15) 


By definition, FR, (7) is dimensionless, and FR, (r) has the units of square-volts. Hence, in order to 
1 0 


normalize all the variables in Eq. (5.14), we normalize r to T and R, to ¢-/T. This procedure for 
0 


normalizing the autocorrelation functions is consistent with the normalization of Eq. (5.5). After 


normalization we obtain 


R, &) = —— sine(2f,7) =f, sinc(2f,7) ; 


Nofot 
: sinc(2f,r) = r, sinc(2f,7) , (5.16) 


m 


R,( = 


where r, and r, are dimensionless variables representing, in some sense, the noise-to-signal ratios of the 


background and modulation noise respectively compared with the peak power in the unerased signal. 


In order to compute F(t,r) using Eq. (5.10), we need the first and second derivatives of R, (7). We 
1 


compute these as follows. 


d ein (27f,7) 


U eam ES 
ns () = art "1 Qnfit 
"4 se 
os (2nf,r) - — sine(2J,7) (5.17) 
R. tek as d ‘ r, cos (27f,7) _ : r, sin (27,1) 
1 Tr r Qnf,r 
2r, 2r, ae 
= ne fn) - By os Onhys) — r,(2rf,)° sinc(2f,r) . (5.18) 


We mentioned earlier that we would be using a discrete-time version of the readback voltage for the 


detection problem, which would require the use a matrix called the covariance matrix to describe the 
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two-dimensional autocorrelation function of the noise. Let us choose the N elements of the vector r 


representing the readback voltage as 
r= r(t.) : l<ez < N. (5.19) 


where 


dh : T 
(oS G-I) 5; (5.20) 


The above definition for the elements of the read back voltage vector implies that the signal pulse 


representing a digital °1" should be discretized to form a vector s whose N elements are given by 


8. = a(t), 1<icN. (5.21) 


The corresponding noise covariance matrix is obtained from Eq. (5.10) by choosing the (#,7)th element of 


the covariance matrix J as 
25, = R,{tpt) l<t KI N,LCSIJKN. (5.22) 


It is important to note here that the noise covariance matrix as defined above is going to be different for 
a recorded "1" and a recorded "0". In fact if we denote these two matrices by 2 and 2, respectively, 


then the (f,7)th elements of these matrices are given by 
Xo; j= Rt; —t), a a.c. erasure , 
xo; = R, (5 - t) - ¢, R(t, —t), for d.c. erasure , 
s 
Zi; = R, (5 - t ) a(t.) a(t Py + R,. (t;— t.) p(t ; a(t.) 
— RF, “(e;— 4) ot) ot) — BR, “t;— t) ot) ot) 
+ FR, (t.-t), lst i N,LSIKN. (5.23) 
o7 
From the above equations it follows that 2, and 2, are symmetric matrices. Furthermore, 2, is a 
Toeplitz matrix, i.e., a matrix which has equal values on each of its diagonals. The covariance matrix a) 


depends on the signal and hence changes with level of erasure. It can be seen that if we reduce the signal 


level to zero both matrices will be equal, as expected. 
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5.3. Bit Detection as a Hypothesis Testing Problem 


The problem of detecting the signal pulse that represents a "1", in a given bit period, in the presence of 
noise can be viewed as a hypothesis testing problem. The two hypotheses under consideration are x, ( 
signal present ) and NW, ( signal absent ). Since we are dealing with a discrete-time version of the 
problem, the readback voltage in a given bit period is discretized into N samples which form the vector 
r. If there was no noise, r would equal s under hypothesis X,, and equal O under hypothesis X,- In this 
case we would not have any problem deciding whether a "1" or a “O" is written in a given bit period. 
Now, if we add noise to the readback voltage, there is a possibility that a "1" might be interpreted as 
"0" or vice-versa. This constitutes an error in detection. Intuition would have it that increasing the 
power in the noise with respect to the signal power would increase the probability of error. The goal of 


hypothesis testing is to evolve a scheme based on the statistics of the noise which would minimize this 


error. 


We mentioned in Chapter 4 that a Gaussian random process is completely characterized in the 
stochastic sense by its mean function and its autocorrelation function. The discrete time equivalent of 
this statement is that a Gaussian random vector is completely characterized by its mean vector and its 
covariance matrix. The random vector r has different statistics under each of the hypotheses. If H, and 
#, represent the mean vectors of r under the two hypotheses, and 2, and 2 are the corresponding 


covariance matrices, then we can write the following equations which define them. 
Hy = Efr/¥), 
Bw, = E{r/X,}, 
Ey = Ef(r—1,) (r—1)"/Hp} » 


B= El(r—n,) (r—-2,)7/%,} (5.24) 


where, superscript T denotes the transpose operation. 


For our detection problem, a "1" is represented by the signal pulse vector s and a "0" is represented by 


no pulse or an all-zero vector. Hence, 
Hy = 0, 


HW, = 8. (5.25) 
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With the above definitions, we can write the following equations for the joint density functions of the N 


components of r under the two hypotheses as given below [14]. These joint density functions completely 


specify the statistics of r. 


p(r/H,) = (5 (rs) 2,7 (r-8)} 


1 
(2n)N/? Zp" 


p(r/H) = exp{— - r? a r}, (5.26) 


1 
(27n)N/2 [Z, [1 
where |-| denotes the determinant of the matrix within the vertical bars. 


Now we are in a position to write an expression for optimal detection scheme. Using Bayes criterion, 


the optimal detector, i.e., the detection scheme with smallest probability of error, can be written as [5] 


ney LM) POD) x, 
) = FEM) < P(X) = rel, (5.27) 


where, P(X) and P(XH,) are the a priori probabilities of hypotheses X, and H, respectively. In the 
absence of any information about the hypotheses, both these a priori probabilities are taken to be equal, 
in which case the RHS of the above inequality would equal 1. 


In words, what the above inequality says is that when the LHS of the inequality is greater than the 
RHS, we decide that r came from X,, i.e. we detect a "1". Similarly, when the LHS is less than the RHS 
we detect a "0". It can be easily shown that this scheme does indeed yield the lowest probability of error 
[5]. The term (U(r) is called the likelihood ratio and is the basic quantity in hypothesis testing. 


Sometimes it is convenient to use the logarithm of the likelihood ratio instead of the likehood ratio 


itself. Then the decision rule becomes 


P(X) , 
h(r) = In {i(r)} a In lean] = H — ret (5.28) 


The RHS of Eq. (5.28), represented by H, is called the threshold of the detector. If the A(r) is greater 


than H, we detect a "1"; and vice-versa. Using Eq. (5.26) we can write the log-likelihood ratio as 


by 
h(r) = =In it et ie ee * (rs)? 5? (r—s) . (5.29) 
2 |f| 2 ~° 2 i 
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For a given detection problem, 2X» x) and s are known. Hence, if we are given a particular readback 
voltage vector r, we can decide whether it is a "1" or a "O" by first computing A(r) using Eq. (5.29), 


and then using the decision strategy of Eq. (5.28). 


The probability of error that results when the above decision strategy is used obviously depends on Xo» 


2 and s. We analyze the following two cases in an attempt to find an analytical expression for this 


error probability. 


5.8.1. The Equal Covariance Matrix Case 


We first consider the simple case when the covariance matrices under the two hypotheses are equal, i.e., 


Da 


o™= 2) == 1’. This is true when modulation noise is absent. Even though it is not a realistic case 
study when we have modulation noise, the results we derive here will be used for later comparison 


studies. In this case the log-likelihood ratio ratio reduces to 
ees | let 5-1 
h(r) = w Dr — 3° Pe (5.30) 


Since h(r) is produced by a linear transformation on r in Eq. (5.30), it is also a Gaussian random 
variable. Let n, and 9, denote the mean values of h(r) under the two hypotheses, and O° and o,” denote 


the corresponding variances of h(r). Then, we can write the following equations for these quantities. 
ee es 
ng = E{h(r)/M} = —5eT Ee = -n, 
ay sro 
n = Efh(e\/¥} — se De = 
0° = E{fh(r)—0,)°/X} = a Do's = 20, 
0? = E{{h(r) — 9,)7/X,} = oT Ds = 2. (5.31) 


Now, since h is Gaussian, we can write the probability density functions of h under the two hypotheses 


in terms of only means and variances as 


p(h/X,) = 


p(h/X,) = ) (5.32) 
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If we now consider the detection strategy specified by Eq.(5.28), there are two ways in which we can 


make an error. The first is when r actually belongs to H, and the log-likelihood ratio h is greater than 


H. The probability of this type of error is represented by €,. The second type of error is when r belongs 


to x, and h is less H. The probability of this type of error is represented by é): Hence, we can write the 


following equations for €, and €,. 


a OL ALLE 


H 
= f p(h/¥,) dh . 


These error probabilities can be written more explicitly as 


| (h+n)* 
Q = I Ja a” 


oo 1 
— Se —_ wa d 
(ntE/Vin Joe a 
(n+H ) 


where erf{.} is defined by 


| 1 fF y 
erf(z) = i [ exp(— 2) dy 


and = erf(—z) = — erf(z), z € [—co,0o] . 


Similarly 


a ee MOL AL 


(5.33) 


(5.34) 


(5.35) 
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00 1 y 
=] — ——— ——)d 
haat Jon =P 2 ) . 


1 —H 
= 5 + enf{ nt) ‘e (5.36) 
von 
The total] probability of error , €, can be obtained from €, and €, as 
e = €, P(M,) + €) P(X). (5.37) 


We see that in the equal covariance matrix case the calculation of error probabilities is a fairly simple 
task. The optimal detection scheme in this case consists of computing a linear transformation of the 
readback voltage, as given by Eq. (5.30) and comparing the resulting quantity with a threshold. The 
continuous time equivalent of this strategy would involve passing the readback voltage through a linear 
system and comparing the output of the linear system at a specific time instant with a prespecified 
threshold. This linear system is called a matched filter and is discussed in great detail in communication 


theory literature [19]. We now consider the more interesting case when the covariance matrices are not 


equal. 


5.8.2. The Unequal Covariance Matrix Case 


In the detection problem of interest, when the noise is signal dependent, the covariance matrices of r 
under the two hypotheses are unequal. In this case the log-likelihood ratio cannot be reduced to a linear 
transformation of r as we did in the equal covariance matrix case. Hence, h(r) is no longer Gaussian, 
and finding the probability density function of h under the two hypotheses is a much more difficult 
problem. We first simplify the detection problem a little in the following way. 


Since r has a normal distribution, we can always find a linear transformation which diagonalizes both 
covariance matrices, x and 2) simultaneously. Such a transformation matrix, A, is one which contains 
the etgen-vectore of a 2, as its rows [5]. If we pass the random vector r through this transformation, 


the random vector that we obtain at the output, y, has components which are independent random 


variables under both hypotheses. 


y = Ar. (5.38) 


The mean vectors and covariance matrices of y under the two hypotheses can be shown to be [5] 


E{y/X,} =AO= 0, 
E{y/H,} = As =d, 
Cov{y/X,} = ALA’ =I, 


Cov{y/N,} = AD, A™ = A, ; (5.39) 


where A is an NXN diagonal matrix containing the etgen-values, Ayre Aap of ae 2X along its 
diagonal. 


The log-likelihood ratio, in this case, has the form 


My) = —SIn{lal} + 3 {¥7y — (ya) a7 (ya) } (5.40 


It is easily shown? that the expressions for the log-likelihood ratio in Eq. (5.29) and Eq. (5.40) are 


equivalent. This means that the transformation, A, preserves the optimality of the detection scheme of 


Eq. (5.27). 


By diagonalizing both the covariance matrices of y, we ensure that the N random variables, YpreoYap 


which constitute y, are independent. Hence, 


N 
py/X) = ]] rly/N), &=0,1. (5.41) 
I=} 


If we define the log-likelihood of the /-th component of y as 


py,/H,) - 
My) = In TM)’ (5.42) 
then 
N 
h(y) = >> Aly). (5.43) 
i=} 


Our goal is to first calculate p(h/X,) and p(h/H,), from the density functions of the A(y,) s; and then, 
use these to calculate the total probability of error, «. There are two approaches to this problem. The 
first involves a rigorous, exact calculation of €; and the second is an approximate calculation of ¢ which 


is valid for large N. 


BSc the Appendix section at the end of this chapter 
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5.4. Exact Calculation of Error Probability 


Here we use a characteristic function approach to find the desired density functions of h. The 


characteristic function of a random variable® , z, is defined by 

b,(0) = E{exp(juz)} . | (5.44) 
For y € x. , =0,1, the characteristic functions of h(y), ¢ {w) are defined by 

¢{w) = Ef exp( xuh(y) )/H;} . | (5.45) 
Similarly, the characteristic functions of A(y,) for y € XN. , are given by 

6,0) = El exp( suh(y) )/H) . (5.46) 


From Eqs. (5.43), (5.45) and (5.46), we get 


N 
¢{o) = JT 4,(x). (5.47) 
luz} 
From Eq. (5.47) we get the following two equations for the absolute value and argument of ¢ {w) 
N 
lt(o)| = TT 6), 
l=] 
N 
£${w) = > L$ ,{w) . (5.48) 
bax] 


Hence, if we can find the absolute values and arguments of ¢ {w), we can use these to obtain the 


characteristic functions of h under the two hypotheses. 


From Eq. (5.39), we see that each of the components, y,, of y is a Gaussian random variable with, mean 


O and variance 1, under 0 and with, mean d, and variance ,, under H,- Hence, 


1 vy 
Ply,/ A) — Tee exp(— 3) ? 


(y,—d,)” 
exp(— 


1 
P(y,/x,) = 
aa Vim, 2h, 


). (5.49) 


Owe have used the symbol ¢ to represent flux earlier on in this chapter. 
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Substituting the above density function in Eq. (5.42), we get 


1 
hy) = —5Ind, + }. (5.50) 


Proceeding from Eq. (5.50), we can show that’ the absolute value and argument of ¢ {w), ¢ = 1,2, are 


given by 
2 2 
1 byw 
I6,{4)| = —>—, xe |-— ]. 
: (14+w*a..?)!/ : 2(1+w%a ,”) 
2 2 
24,40) = LeasYo,o) - Su [ hy +> | 
AW) == =—tan ‘(a. —- -W t—_— ll. 5.01 
$ 9 it” 2 sl 1+u"a., ( ) 
where 
; 1 
a, =1-—, 
a, 
awe 
ao )? 
d; 
2 
boy 
hy, = aps + Ind,, (5.52) 
ol 
and 
a, = r, -— 1, 
1/2 
= al dq, 
2 
h ar 5.53 
u l+a,, ve oP) 


Hence, using Eqs. (5.51) and (5.48), we can compute the characteristic functions of h under both the 
hypotheses. Our goal is to make use of these characteristic functions to determine the total probability 


of error, €. The procedure is as follows. 


"See the Appendix section at the end of this chapter for details. 


63 


From the characteristic functions, ¢ {w), we can compute the density functions p(h/X,) as 


plh/W) = oof ofu) exp(—duh) de 


Also, from Eq. (5.33) we know that 


00 H 
= | wh/y)adh = 1 - [vn an, 


H 
= / p(h/X,) dh . 
—0o 
where 


ae (X,) 
P(x,) 


_ Using a familiar rule of the Fourier transform® we get, 


¢ {0) 1 sre ¢ {w) 


H 
fo wry) dh = or weer See ae exp(—jxvH) dw . 


(5.54) 


(5.55) 


(5.56) 


(5.57) 


Now, ¢ {w) has an even real part and an odd imaginary part. Hence, the integrand on the R.H.S of Eq. 


(5.57) has a real even part and an odd imaginary part. Therefore 


pss exp(—jwH) dw = 2 [Re exp(-juHl) | dw 


00 |¢ {w)| 
se 2 | sin { £${u) — oH } du. 
0 wW 


Also, from Eq. (5.45), we get that (0) = 1 . Hence, Eq. (5.57) reduces to 


oo lo {w)| 
[ p(h/X;) dh -;--/ sin { Z¢{w) — wH } du. 
co 0 ud 


rf 


BSee the Appendix section at the end of this chapter for a derivation of this result. 


(5.58) 


(5.59) 
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Substituting Eq. (5.59) in the two equations of Eq. (5.69), we get the following analytical expressions for 


Ep and €,: 


co [¢,(w)| 
= - + - [ : sin { 2¢,(w) — wH } du. (5.60) 
1 1 sro ld, (~)| 
°C =5 - | sin { 2¢,(w) — wH } dw. (5.61) 


In any practical case the above indefinite integrals have to carried out numerically. Fortunately, the 
integrals are 1-dimensional and hence the integration is manageable. Because the integrals are indefinite 
there are two factors which determine the accuracy of the result; the sampling interval] and the total 
length of integration. For small error probabilities, the accuracy required in the computation of the 
integrals may be so high that it is not practical to go through the process. In such cases we may have to 


depend on an approximate calculation which is less tedious. 


5.5. Approximate Calculation of Error Probability 


In Eq. (5.43) we saw that the log-likelihood ratio of A(y) can be written in terms of the log-likelihood 
ratios h(y,) of the components of y as 


N 
hy) = > My). (5.62) 


bux] 


In this method we make the following approximation. Using the Centra] Limit Theorem, we claim that, 
for sufficiently large n, h(y) is approximately Normally distributed, i.e. p(h/ H,) and p(h/X,) are 
Gaussian density functions. To recall, we defined the log-likelihood ratio as 


My) = — Sin {ll} + 5 y7y — (ya 7 (ya) }. (5.63) 


In order to compute the error probabilities we need to know the means and variances of h under the two 
hypotheses. If 7, and 9, denote the means of h under the two hypotheses, and a and o,’ denote the 


corresponding variances, then we can show that” these are given by 


Mee the Appendix section at the end of this chapter for details. 
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1 1 1 
"= — in{l4l} + - r{I—A"} 54 Ad (5.64) 


1 1 1 
T 
—— =—tr{A— = : 

uP 5 intl} + = r{A—I} + 54 d (5.65) 
1 3 a 

oj = 5 r{(l-4 2) 4 at an? a (5.66) 
: . 

7 = 5 f(A — 1)"} + dad, (5.67) 


where tr{.} denotes the trace of a matrix. 


With the Gaussian assumption, we can write the probability density functions of h under the two 


hypotheses in terms of the means and variances defined above as 


l (h—n,)° 
p(h/.) = exp(- ——), 
2n % 205 
(h—n,)? 
p(h/X,) = exp(— 7): (5.68) 
2r o; 20, 


The probabilities of error under the two hypotheses, ¢€, and €, can hence be calculated in the same lines 


as we did in the equal covariance matrix case. 


co 1 H—n 
6 = [ p(h/H,) dh = ait 


" : a 69 
= | = = —e | 5. 
4 [ wo MN/H) dh = 5 + ert ——} (5.69) 


The Gaussian approximation that we made in the beginning of this section requires two conditions to be 
correct. The first is that the component random variables in Eq. (5.62) must closely resemble each other 
and the second is that N must be fairly large. The first condition is not met when the covariance 
matrices are very different from Toeplitz. If the contribution of the modulation noise is significant, 2, is 
going to be very different from Toeplitz as Eq. (5.23) indicates. Hence, we might get misleading error 
probabilities using this method even when WN is large. In the next section we discuss one way of 
obtaining an upper bound on the error probability which may prove to be a better method of estimating 


error probabilities when the modulation noise is large. 
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5.6. Upper Bounds on Error Probability : The Chernoff Bound 


It is evident from the discussion in the last two sections that calculating error probabilities in the 
unequal covariance matrix case is, in general, a difficult task. Even when the observation vectors have a 
Gaussian distribution, we must resort to numerical integration of some form. The approximate 
calculation described above is one way to get around the problem. Another way is to seek an easily 
computable expression for an upper bound on the error probability, because in many practical cases the 


upper bound is all the information we need. 


One such set of upper bounds on the error probabilities €), €, and ¢ are the Chernoff Bounds [5] which 


can be represented by the inequalities 


P(X;) 1 
& S$ lean)” exp -u(5)], 


P(X) 1 
Le |” eel-a(5)1. 


€ 


lA 


1 


eS [P(M) PO)? el -u(5)], (5.70 


rr 


] . 
where ju( r ) is the Bhattacharya [3] distance between the two hypotheses, defined by 


1 1 
Ws) = se7[ Jn + Sm + D2) — Tm CIB IDI}. (5.71) 


It can be shown that [5] the Chernoff bound is very close to the exact error probability when the error 
probabilities are very small, but could be very different from the exact value when the error 
probabilities are large. This fact is very useful because, as we shall see in the last section of this chapter, 
numerical computations of the exact probabilities are much simpler and more accurate for large error 


probabilities, and we would not have to use the bound anyway. 


5.7. Linear Detection Schemes 


From Eq. (5.29) we can see that the log-likelihood ratio h(r) for the optimal] detection scheme is 
obtained from r by a quadratic transformation of r. This means that if we implement the optimal 
detection scheme by passing the readback voltage through a filter to obtain the log-likelihood ratio, the 
filter that would have to be used would be a quadratic, i.e., non-linear, filter. Implementing a non-linear 


filter is, in general, not an easy task. In this section we shall approximate the log-likelihood ratio by a 
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linear function of r, and find the coefficients of the linear transformation vector which minimize the 


error probability when this detection scheme is used. We write h(r) as 
h(r) = Vir + > (5.72) 


where V is column vector of size N, and U, is a constant. 


The decision strategy that we use now is 


T > x) 
A(r) = Vor +u 2 0 ~re{'. (5.73) 
0 


Since r is normally distributed, h(r) as defined above is also normally distributed. Hence, in order to 
obtain the probability density functions of h under the two hypotheses, we need to calculate only the 


means and variances of h under the two hypotheses. These are given by 


Ny = E{ h(r)/X,} = V7 E{e/X,} + = v, 


n, = E{ A(r)/X,} = V7 E{ r/X, } + u = Vis + Uy » 
P= WE rr j}V = VV, 
2 = V7 E{ (rs) (r-s)’ }V = VZV. (5.74) 


It is easily shown that the total probability of error that results, when the decision strategy of Eq. (5.73) 
is used, is given by 


2 —n,/o 2 
c= POD LT , ily see a + PH) f” a exp(——)dh. (5.75) 
0 Tv 


Our goal is to find the coefficients of the linear transformation that minimize the above probability of 


error. Hence, we differentiate € with respect to V and vu, and set the resulting quantities to zero. 


7) 1 m9 1 1 8 No 
€ 
— = P(X) — en -—5) = [-—]- PX) = er -—) & [-= 
dav * Joy 20,” aA —t aay 20,” a -) 
1 n - Sls. )2,V 
= P(.)— nc ec tae 
as tha o,? | (VEvy?” (v7 5, ve? 
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P(,) —= exp( - —) —— 
Von 20,7 (vo, Vv)? 
, 2 
= — P(X,) xp(-—z)[s —2,Vv] 
o 27 1 1 
2 
1 No 0 
— P(X.) exp(- 5) — 2, V = 0. (5.76) 
0 Van 20, % 
and 
2 
o F,) " 
Bm rayon 2-8] 
27 1 % °; 
2 
F,) ") 
— P(X) —= ex -— 5) —[-=] 
2r m ad) 
2 
1 ” 1 . 
= — P(X,) exp( - — ) + P(X) exp( - —— ) = 0. (5.77) 
o,Van 20, o Van 205 
Using Eq. (5.77) in Eq. (5.76), we get 
No 1; 
-=%v -[*-—2,v] =o 
% 7} 
1 No 
i.e., s = Pa 7 = Xo | Vv. (5.78) 
1 0 
Also, rewriting Eq. (5.77) we get 
2 2 
FW, = (5) = AM) = ol - 5) (5.70) 
, exp(i-——) = A exp(—-——). 
o Van 20,” o Van 205° 


Solving Eqs. (5.78) and (5.79) yields V and vu, which minimize the probability of error ¢. Unfortunately, 
an explicit solution to the above equations is very difficult to obtain, and hence we need to use an 


iterative procedure to find the solution. 


A simple iterative solution to this set of equations was first suggested by Peterson [15]. Instead of 
solving Eqs. (5.78) and (5.79) directly, the minimum of ¢ is sought under the conditions of Eq. (5.78) as 


follows. 
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V = a[ad, + (1-—a) ZL, J's, (5.80) 


where 
2 2 
1 1, "No (5.81) 
earn aa : 
“7; % | 
and 
2 
No/% 


ws «Ea Re ETE EE ERE RETR * (5.82) 
2 2 
( n,/e, = No/%> ) 
. T 
Now, since n, = vu, and "|= V' s+), and a oy n+ (1—a) o,” Nn, = 0 from Eq. (5.82), uv, can be 
calculated as 


a o,. vi s 


0 2 2 
ao, + (l—a)o, 


From Eq. (5.83) we can see that if V is multiplied by a, U% is also scaled by the same factor a. The 


decision made by Vir + U is O is the same as the decision made by a Vir +av, 7 0. Hence, € is 


0< 
invariant under the scale change. Therefore, by ignoring the scale factor of a, we can plot € as a function 


of one-parameter a as follows. 


e Calculate V for a given a with a = 1. 
e Using this V, calculate oy" o,”, V7s and Up: 
e Calculate € using Eq. (5.75). 
e Change a from 0 to 1 continuously. 
From this plot, we can find a for which ¢ is minimum. We then use this value of a to calculate V and 


v, for the linear detection with minimum probability of error. 


8.7.1. Correlation Detector 


A special class of the linear detection schemes is the Correlation detector. The correlation detector has 


V and U% which are given by 


V=s, (5.84) 
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P(X)) 
ler 0 ) 
Ur 5 8 © — BOD (5.85) 


The detection strategy that results from the above choice is 


P(H,) 1 M 
T oo egg, 2m i | 
ere ” POM) + 5a 8 ret (5.86) 


The LHS of the above inequality is the dot product, or the correlation, of the readback voltage vector 
with signal pulse vector. This is the reason for calling this detector a correlation detector. The 
correlation detector is very easy to implement, and is very commonly used in scenarios when the noise is 


stationary and white because it can be shown to be optimal in this case [11]. 


5.8. Nonparametric Detection Schemes : The Sign Detector 


All the detectors that we have considered so far assume that we know probability density functions of r 
under the two hypotheses. If the actual probability density functions of r are the same as those assumed 
in determining the detection scheme, the performance of the detector in terms of error probability is 
good. If, however, the actual probability density faiietions are considerably different from those 


assumed, the performance of the parametric detector may be severely degraded. 


Nonparametric det.:iors do not assume that the input probability density functions are completely 
known, but only make general assumptions about the input such as symmetry of the probability density 
function and continiuity of the cumulative distribution function. Since there are a large number of 
density functions which satisfy these assumptions, the density functions of the input may vary over a 
very wide range without altering the performance of the nonparametric detector. Of course, the 
performance of the nonparametric methods will be inferior to the parametric optimal detection schemes 


when the statistics are completely characterized. 


We shall consider one such nonparametric detection scheme in this chapter, namely, the stgn detector 
[6]. The sign detector utilizes only the polarity of the data to make its decision. We assume that the 
signal takes on positive values, i.e., all the components of s are positive. We have seen earlier that in 
digital magnetic recording the signal pulse is either a positive going pulse or a negative going pulse. 
Since we assume that the sign of the pulse is known before hand, the analysis that we do for positive 


pulses can be easily repeated with minor modifications for negative going pulses as well. We also make 
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the assumption that the input data vector r has components which are statistically independent. With 


this assumption we can write the probability density functions of r under the two hypotheses as 


N 
pr/X) = TI] plr,/%,) 


lez} 


N 
p(r/X,) = J] vlr,/X,)- (5.87) 
bax] 
Since we are only interested in the signs of r,, we do not need the entire density functions of r, under the 


two hypotheses. All we need are the probabilities that r, is positive ( or negative ) under the two 
hypotheses. 


Under hypothesis Xo» since we have only noise which is zero mean, each of the r,s is equally likely to be 


positive or negative, i.e., 


1 
P(r, 2 0/Mg} = 1 - Pr, < O/H} = g. (5.88) 
Under hypothesis ¥,, each of the rs is more likely to be positive than negative, i-e., 


| 1 
P{r, 2 0/M,} = 1 -— Mr, < 0/X,} = p, PY> 5° (5.89) 
To recall, the decision strategy that yields the minimum error probability can be written in terms of a 


likelihood ratio and a threshold as 


Plr/H,)  P(%) x 
ir) = re 


1 
~~ : (5.90) 
Helm) < PO) ¥, 
Using Eq. (5.87) we can express the likelihood ratio I(r) in term of the likelihood ratios of r,, | = 1,...,N, 


p(r/ x) N’ plr,/ a ~ 


(r) = ae i I Ae TM) - i K(r,) . (5.91) 


From Eqs. (5.88) and (5.89) we can write I(r,) as 


2 P, for r 
Kr, a { (5.92) 
2 (1 — p,) for or, < 0 
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For a particular observation vector r, let L nm denote the set of values of ! for which r, is positive and L 


denote the set of values of / for which r, is negative. Then we can rewrite Eq. (5.91) as 
(r) = Jf 2p, JJ 20-») (5.93) 
L, L_ 


Hence, the decision strategy to be adopted is 


2 2(1 > FO) (" 5.94 
MT tem) Th e20-0)) 2 pa ety (5.94) 


Even though the above decision strategy looks fairly simple, error analysis for this detector is not easy. 
Gibson and Melsa [6] have obtained an analytical expression for the error probability when the 
components of r are identically distributed, i.e., all the p, 8 are equal. This condition is very far from 
true in our detection problem, since the signal has much larger values at the center of a bit period than 
at the extremities. Hence, we would have to either simulate the detection problem on the computer or 


conduct an experiment on real data, to find the bit error rates when the sign detector is used. 


Since our goal in this project is to obtain conservative estimates of the probability of detection of an 


erased digital signal, we will not pursue this sign detector any more. Rather, we will concentrate on the 


performance of optimal detection schemes. 


5.9. Numerical Evaluation of Error Probabilities 


5.9.1. Case Studies Considered 


In order to evaluate the error probabilities for the various detection schemes that we discussed in this 
chapter numerically, we need to assign specific values to the various dimensionless quantities that we 
defined in the first section of this chapter. These are : the normalized transition width parameter p, the 
Jevel of erasure g, the normalized bandwidth f, and noise-to-signal ratio r of the modulation noise, and 
the normalized bandwidth f, and the noise-to-signal ratio r, of the background noise. Another 
parameter that can be varied is the type of erasure, i.e., a.c. or d.c. In general, all the above parameters 
could be varied to form different combinations which could represent various recording systems. Here, 


we shall only consider a few case studies which we consider to be fairly representative. 


In all our case studies we shal] assume that 
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p = 0.2, 


f, = 0.1, 
fg = 15.0. (5.95) 


These values have been chosen to correspond to the experimental data that we obtained in Chapter 4, 
for a bit period of 0.5 p-sec as shown in Fig. 4-2(a). We can easily see that for T = 0.5 y-sec, the 
transition width parameter, pT would be 0.1 y-sec, the modulation noise bandwidth would be 200 kHz, 


and the background noise bandwidth would be 30 MHz, which is approximately what these values are 
for the data in Chapter 4. 


To study the dependence of the bit error rate on the level of erasure, we shall vary g from 1 to 10%, ive., 
vary the signa] power from the unerased level down to -80 dB, in conveniently chosen steps. Note that 
when the signal level is reduced by erasure, the modulation noise is also reduced correspondingly, but 


the background noise remains unchanged. 
The four case studies we shall consider are : 

e a.c. erasure, low modulation noise ( r,. = 10°, = 10 ) 

e a.c. erasure, high modulation noise ( r,) = 10°, = 107 ) 

e d.c. erasure, low modulation noise ( r) = 10°, i= 10% ) 

e d.c. erasure, high modulation noise ( ry = 107, r, = 107) 
The values of To and r in case studies 1 and 3 correspond closely to the values that we would have 
obtained for the autocorrelation functions described in Fig. 4-5. These values indicate that modulation 
noise level is about 20 dB below the background noise level, and hence cases 1 and 3 will be referred to 
as low modulation noise cases. For the sake of comparison, we also consider cases when the modulation 


and background noise are of the same level. These are represented in case studies 2 and 4, which we 


shall refer to as high modulation noise cases. 
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5.9.2. Software Developed for Numerical Computation 


Using the analytical expressions we developed in this chapter, we have written Fortran programs to 
numerically evaluate the error probabilities when the three detection schemes, namely, the optimal, the 
best linear and the correlator, are used for detection. For the optimal detection scheme, we calculated 
the error probability in the three ways discussed. Of all these programs, the one that does an exact 
calculation of the error probability for the optimal detector is the most tedious since it involves the 
numerical integration of two indefinite integrals. We used Weddle’s Rule [4] to compute these integrals. 
For most of the matrix operations, we used IMSL ( Internationa] Mathematical and Statistical Library ) . 
routines. Except for the program which calculates the exact error probability for the optimal detection 
scheme which took about 40 minutes of CPU time, all the other programs took about 1.5 to 2.0 minutes 
of CPU time to generate one value of the error probability on a VAX-11/750 processor. The resulting 


error probabilities for these detection schemes, for all four case studies, have been listed as a function of 


the level of erasure in Tables 5-1 to 5-8. 


As we mentioned earlier, an exact calculation of the error probability for the optimal detector involves 

the evaluation of two indefinite integrals ( see Eqs. (5.60) and (5.61) ). For small error probabilities, 
each of these integrals is going to differ from 0.5 by only a small amount. Hence, if the error 
probability is of the order of 10”, we need to have at least n decimal places of accuracy in the integral 
to get a reasonable result. We have used double precision arithmetic which provides 16 decimal places of 
accuracy in the computation of the integral. Hence, however fine a sampling interval we use for the 


integration and however long we choose our integration length, we cannot calculate error probabilities 


which are smaller than 102°. 


5.9.8. Results of Numerical Evaluation of Error Probabilities 


Tables 5-1 to 5-4 show values of the the error probability for the optimal detection scheme in three 
different ways. The Chernoff bound is in the form of an exponential and hence the exponent can be 
calculated even when the error probabilities are very small ( 10°°4" | as Table 5-1 shows ). The smallest 
value that the approximate calculation can yield is determined by the smallest value that is allowed by 
the double precision arithmetic, which is ~~ 10°°°, When the modulation noise level is high, as in Tables 
5-2 and 5-4, the approximate calculation can be so inaccurate that it yields an error probability which is 


greater than the upper bound. In such cases we have marked the value of error probability obtained by 


an asterisk. 
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Tables 5-5 to 5-8 list values of the error probability for the two linear detection schemes that we have 
considered, namely, the best linear detector and the correlator. We can see that the best linear detection 
scheme yields smaller values of error probability than the Chernoff bound in most cases. Since the 
optimal detector will perform better than even the best linear detector, when an exact calculation of 


error probability for the optimal detector is not possible, the linear detector results can be used as a 
better upper bound than the Chernoff bound. 


Tables 5-9 to 5-12 have been listed to compare the performance of the three detection schemes at erasure 


levels for which an exact calculation of error probabilities of the optimal detector is possible. We can 


observe the following trends : 


1. In all tables the probability of error increases with increasing levels of erasure. Since we have assumed 
that the two hypotheses are equally likely, the largest value that the error probability can take is 0.5, 
and this happens when the signal information is not used in making the decision, i.e., whether a bit is a 
"1" or a "0" is decided completely randomly. We see that this indeed is the case, i.e., the error 


probability in all tables approaches 0.5 at high levels of erasure. 


2. At a given level of erasure, the error probabilities for all three detection schemes are, in general, 
larger under conditions of d.c. erasure than under conditions of a.c. erasure. This is to be expected since 


d.c erasure introduces an additional modulation noise term around zero frequency. 


3. The optimal detector performs better than the best linear detector, which in turn performs better 
than the correlator, in terms of minimum probability of error at low levels of erasure, but all of them 
perform equally well at high levels of erasure. As we mentioned earlier ( See Eq. (5.23) ), high levels of 
erasure cause the covariance matrices under the two hypotheses to be equal to each other. The optimal 
detector for the equal covariance matrix case is a linear detector. Hence, the best linear detector 
performs just as well as the quadratic optimal] detector at high levels of erasure. Also, since we have 
assumed that the background noise is band-limited white noise in our analysis, the correlator, which is 


optimal for white noise, performs almost as well as the linear detector at high levels of erasure. 


4. For a fixed ry ( 107 ), changing r, from 10% to 10° does not increase the error probability very 
significantly at erasure levels greater than 20 dB. This is again because of the fact that the modulation 


noise is suppressed when the signal is erased whereas the background noise is unaffected. 
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5.9.4. Probability of Retrieving Sequences of Bits 


So far we have considered only bit error rates. Let us denote the probability of bit error by P.. From P, 


we can find the probability of correctly retrieving a bit P p 8s 


Pp=1-P. (5.96) 


Now, information is stored on the disc as a sequence of bits. This sequence could be encoded by RLL 

( run length limited ) encoding and by error correcting coding. Hence, in general, the probability of 
correctly detecting a sequence of bits depends not only on Py but also on the ascitic encoding schemes 
used. As an illustrative example of how one can obtain probabilities of estimating sequences correctly, 
let us consider the probability of detecting a "byte" ( = 8 bits ) of information when no encoding is 


used. In order to correctly detect a byte, we need to correctly detect each of the 8 bits. Hence, 


Probability of correctly detecting a byte = [ P, ah (5.97) 


For example, at the highest level of erasure ( 80 dB ), the bit error rate is approximately 0.498 for all 


case studies, i.e., 
af == 0.498 and therefore P, = 0.502. 
Probability of correctly detecting a byte = (0.502)® = 4.03 x 10°. 


We note that the probability of correctly detecting a byte is two orders of magnitude smaller than P in 


this case. If we go to longer sequences the probability of correct detection will be even smaller, though 


this value can be increased by the use of error correcting codes. 
5.10. Appendix 


5.10.1. Proof of the equivalence of Eq. (5.29) and Eq. (5.40) 


Starting with Eq. (5.40) we get, 


My) = — Zin {lal} + 5 (yy — a) 7 (a) }. (6.98) 


Now, since A is a diagonal matrix that contains the eigenvalues of x, as its diagonal elements, we 


get 
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abit 
0 1 IZ,| ° 
Also, from Eq. (5.38) and Eq. (5.39), we get 
yTy = [Ar}’[Ar] = rfaATAr = 7558, 


and 


(y — d)’ A"* (y — d) = (Ar — As)" A) (Ar — As) 


= (r—s) A7A'A(r—s8) = (r—s)' 5,1 (r-8). 


Substituting Eqs. (5.99), (5.100) and (5.101) in Eq. (5.98), yields (5.29). 


5.10.2. Derivation of Eq. (5.51) 


Starting from Eq. (5.50), we derive Eq. (5.51) as follows : 


(y,—d,)° 
d, } 


1 1 
hy) = —gind, + y= 


Now, Eq. (5.46) can be written as 


(0) — [expt sahly) ) alv/%) dy. 


From Eq. (5.102) and Eq. (5.103), we get 


7 2 2 

roe) . (y,—d,) y 
WY. 5 "| 1 1 

¢o{¥) = [ exp |= {y < —Ind } | —— exp(— =) dy, 
-_ a d, y Jan 2 

; Qywd.y god? gw ind 

o 61 1 2 . yp hd | l J 
oy en re es en eek esos | Be 
-0 Jon ” ate d, , 2d, 2 
22 
1 boy 


———————=_¢*X [-;——-5* | 
(1—jua,)*/? PL 2-yoa) 2d 


where 


(5.99) 


(5.100) 


(5.101) 


(5.102) 


(5.103) 


(5.104) 
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qd, 
b = ame 
0! : 
», 
2 
bo, 
hy = — + Id. 


0l 


The absolute value and argument of ¢,{w) can then be calculated as 


i b Pu 
0 
It) = ——, exp | -——— |, 
(1+wa,7)"/ : 2(1 +w*a, 7) 
b 2.2 


Zo, 


o @ 
|: 


ee 1 
29 {w) = 5 tan (an) — 54 | hy + ia, 


Similarly, 


ju Ind, 


(y,—d,) 
ju 7% 1 
6,fo) = | exp | = {47 — x —1n),}] = 
T 
heats 
ol 1 , 
= exp [= so fu? (1 — 50d, + 4s) — 2yjd(145) } 
OO 2rd i 
jud, 
2 


1 
Tren eed coc 


where 


The absolute value and argument of ¢, {w) can be calculated as 


2 


(5.105) 


(5.106) 


(5.107) 


(5.108) 
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We can combine Eq. (5.106) and Eq. (5.109), to get Eq. (5.51). 


5.10.3. Derivation of Eq. (5.57) 


Starting from Eq. (5.54), we get 


[ ¥ p(h/M,) dh = [ ‘ dh ~ : : $(w) exp(—juh) duo. 


= ~ [ : $ (u) [ / egonahaa: 


From a table of Fourier transforms, we get 


_ j 
[xp -sah) dh = exp(—soF) Hu) +2). 


Substituting Eq. (5.111) in Eq. (5.110) we get 


H ¢ {0) j fe ¢ {w) 
fey dh = a + ri Pars exp(—jwH) dw 


5.10.4. Derivation of Eqs. (5.64) to (5.67) 


We begin by first establishing some simple equalities shown below. 


N N 
Ely” y/X,} = >> Elyp/¥,} = Do 1 = wf}. 
jax] la=] 


(5.109) 


(5.110) 


(5.111) 


(5.112) 


(5.113) 
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N N 
E{(y—d)” (y—d)/¥,} = >> EX(y,-4)?/¥,} = Dod, = trl}. (5.114) 
le} l=} 


N N 
E{y™ A y/N.} = S> Ely? "/M} = Soap? = wf{a}}. (5.115) 
l=] l=1 
N 
E{(y—d)” A“ (y—d)/¥,} = . E{(y—d)? \7/¥,} = Yo dA? = tefl}. (5.116) 
j==} 


| N 
E{(y7 y)?/H,} = S> E{y'/,} + > > Ely? y_?/Xo} 
j=] 


l=1 m=1, msl 


N N N 
=3y}o1+>0 YO VP = 2Qer{I} + [tr{t}]. (5.117) 
bux] l=1 m=x=1, m x | 


N N N 
E{(y? At y)/Mo} = So Ey 7M} + Ely */ Mo} 


b=} l=1 m=1,m¥l 


= >» > + > > a Veli 


luc] m=l,m xl 


= 2tr{A-7} + [tr{a}]?. - (5.118) 


N N oN 
EX(y7 y) (y7 A y)/¥} = > Ey 1/8} + D> DD Efy,2y7d,, 1/ No} 
la] 


l=] m=1,ml 


N NN 
ee ee 2 


d==] l=1 m=1,m él 


= 2tr{A} + tr{I} tr{A7)}. (5.119) 


Also, we know that because y, 8 are Gaussian random variables, all third moments are zero. Therefore 


E{(y7 y) y/X,} = 0, (5.120) 

E{(y7 y) y7/H,} = 07. (5.121) 
Similarly 

EX{ (y—d)" (y—d) ]°/¥,} = 2tr{4?} + [er{ap]’, (5.122) 

E{[ (y—d)" A? (y—d) J°/H,} = 2tr{I} + [ofp], (5.123) 
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E{[ (y—d)” (y—d) } [ (y—d)? A7? (y—d) J/¥,} = 2 tr{A} + tr{A} tr{I}, (5.124) 
E{(y—d)" (y—d) (y—d)/¥,} = 0, | (5.125) 
EX{(y—d)” (y-d) (y—d)"/H,} = 07. (5.126) 
Using Eqs. (5.113) to (5.126) we can compute E{h/X,}, E{h/H,}, Var{h/X,} and Var{h/H,} as follows. 

E(h/W) = —Sln{lAl} + 5 Ety7y/M) — 5 Ely AT ¥/M,) 

es 
+ Ely! A") d/¥,} - sd Ald 
= — sin {|A]} + = t(1) — = {A} — =a? A d 


om — sin {JAl} + = tr{l— 471} _ =a? A d. (5.127) 


E{h/¥) = — 510 (lal) + 5 El(y—a)? (y-a)/¥,} 
+ Ely" a/¥,) — 5 Ely)" A (y-a)/n,) — 27a 
1 | 1 lor 
= — 5 n{la}} + 5 tA} _ 9 HD + 34 d 


1 1 L 
= —Fin{lal) + Sue(A-1) + 57a. (5.128) 


Var(/¥,) = EtLsy7y - aya y +y' Ald - = {I - £71} 1°/¥} 
= LEiyT y)"/M) + 2 EU? A? ¥)?/%} 
+ E{a? Ay y7 A) d/X,} + = [ur{t— 274} ? 
- = Ely y yA y/H,} + E{y7y y7 A) d/H,} 
— Efy7 Ay y™ A a/X} 


1 1 ie “a 
— tr{I —A71} E{sy'y-sy' A ly+y7A 1 a/X, } 
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= - [ 2 tr{I} + tr{I}?} + ; [ 2 tr{A-7} + tr{A-7}? J 
+ @a’a — : [ 2 tr{A} + tr{I} tr{A7!} ] 
+ Lez yy - eG ryt peg- 2) 


1 
= 5u{a- A” )?} + a? a? d (5.129) 


Var{h/¥,} = EXL : (y—d)"(y—d) + (y-d)"a — {y—a)ANy—d) — 5 {A —1))?/%,) 
= = E{L (yall (ya) 2/%} + E(AT (a) (ya)? 4/%,) 


+ SEL a" 7 (ya) 727) + Se — 
+ E{(y—d)" (y—d) (yd)? d/X,} 


— E{(y—d)" A” (y—d) (y—d)" d/H,} | 

5 Eliya)? (yd) (ya)? A (y—a)/,} 

— t(AT) [ El fy-a)y—a) + (y—a)Pa — 5 (yay y—a)/%,} J 

= - [2 tr{A’} + (tr{4})*] + - [ 2 tr{I} + (tr{I})?] 
+ dAd — - [ 2 tr{A} + tr{A} tr{I} ] 
1 1 1 : 
— tr{A-—T} [5 tt4} — 5 tril} ) + z fils — I} ] 
= = tr{(A —I)*} + d7Ad. (5.130) 

If we denote E{h/X}, E{h/H,}, Var{h/N,} and Var{h/X,} by no, ,, oo" and o,” respectively we can 
rewrite Eqs. (5.127) to (5.130) as 


No rs n{|A]} + - tr{I — A} = d ( ) 


83 


1 1 1 
mn = —5in{ldl} + 5e4- + said. 


1 
7 = 5 r{(i-4')} + dAa?*d 


1 
o7? = 5 tr{(A — 1)"} + d?Ad. 


1 


These results are used in Chapter 5 to analyze the various bit detection schemes. 


(5.132) 


(5.133) 


(5.134) 
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Level of 


Probability of bit error 


erasure (dB) Chernoff Approximate Exact 
bound calculation calculation 
0 10°34” < 10% a 
-2.5 10°22” < 10° a 
-6.0 10:7" < 10% a 
-12.0 7.0661 X 10° 1.7235 x 10°? = 
-20.0 4.4066 X 10” 7.0203 x 107 6.9444 xX 10” 
-22.5 7.0777 xX 10% 1.4607 x 10% 1.4601 x 10% 
-26.0 2.6970 X 10 7.8343 x 10° 7.8342 x 10° 
-32.0 0.24081 0.11338 0.11334 
-40.0 0.44483 0.31441 0.31440 
-60.0 0.49943 0.48205 0.48205 
-80.0 0.499994 0.49828 0.49828 


Table 5-1: Optimal detection, a.c. erasure, Low modulation level (r, = 107, i= 10° ) 


Level of Probability of bit error 


_ erasure (dB) Chernoff Approximate Exact 
bound calculation calculation 
0 2.5414 x 10°)? 1.1359 x 10° * : sn 
-2.5 5.2029 x 10°)” 1.2765 x 10°” * — 
-6.0 2.1059 x 107)! 1.7533 x 107 * = 
-12.0 2.6865 x 10° 7.5309 x 10°” * 7.8964 x 107? 
-20.0 1.6278 x 104 1.1919 x 10% 2.4194 x 10° 
~22.5 2.6665 X 10° 8.7449 x 104 5.7445 x 104 
-26.0 3.6262 x 10° 1.1185 x 10° 1.0191 x 10° 
-32.0 0.24569 0.11657 0.11659 
~40.0 0.44506 0.31472 0.31472 
-60.0 0.49947 0.48072 0.48071 
-80.0 0.49994 0.49820 0.49820 


Table 5-2: Optimal detection, a.c. erasure, High modulation level (rp = 107, = 10°? ) 


Level of 


Probability of bit error 


erasure (dB) Chernoff Approximate Exact 
bound calculation calculation 
0 10°°48 < 10% = 
-2.5 10°28 < 10° — 
-6.0 10°12 < 10° hs 
~12.0 4.0379 x 10°! 3.9188 x 10°” a 
-20.0 6.1710 x 10° 9.2907 x 10” 8.2775 X 10” 
-22.5 8.5996 x 10% 1.8227 x 104 1.7785 x 104 
-26.0 2.9479 x 10° 9.6126 x 10° 8.6484 x 10° 
-32.0 0.24636 0.12386 0.11689 
-40.0 0.44648 0.31944 0.31702 
-60.0 0.49943 0.48122 0.48100 
-80.0 0.499994 0.49821 0.49810 


Table 5-3: Optimal detection, d.c. erasure, Low modulation level (r, ="107, = 10° ) 


Level of Probability of bit error 


erasure (dB) Chernoff Approximate Exact 
bound calculation calculation 
0 9.8698 x 10°72? 1.7514 x 10° * | a 
-2.5 2.9405 x 107} 2.6323 x 10° * a 
-6.0 3.9133 x 107° 5.1615 x 107 * = 
-12.0 1.0858 x 10° 6.9187 x 10° * 3.8334 x 10° 
-20.0 1.5532 x 107 3.7953 x 10° 2.5685 x 10° 
-22.5 6.4021 x 10° 1.5229 x 107 1.47280 x 107 
-26.0 0.19418 5.2718 x 107 5.2480 X 10? 
-32.0 0.39316 0.21586 0.21584 
-40.0 0.48112 0.37728 0.37726 
-60.0 0.49981 0.48781 0.48780 
-80.0 0.499998 0.49878 0.49878 


Table 5-4: Optimal detection, d.c. erasure, High modulation level (r) om 10°, r= 10°? ) 


Level of Probability of bit error 


erasure (dB) Best Linear Correlator 
0 < 10° < 10° 
-2.5 < 10° < 10% 
-6.0 < 10 ~ <10% 
-12.0 6.5450 x 10°° 1.7589 x 10°? 
-20.0 7.0279 X 10° 7.0476 X 10° 
~22.5 1.4610 x 104 1.4635 x 104 
-26.0 7.8342 x 10° 7.8407 x 10° 
-82.0 0.11337 0.11340 
-40.0 0.31434 0.31436 
-60.0 0.48071 0.48072 
-80.0 0.49807 0.49807 


Table 5-5: Linear detection, a.c. erasure, Low modulation level (r, == 10°, i= 104 ) 


Level of 


Probability of bit error 


erasure (dB) Best Linear Correlator 
0 4.0671 x 10°? 1.3657 x 10° 
-2.5 6.4937 x 10°19 1.5210 x 107 
-6.0 1.3445 x 1076 2.0498 x 10” 
-12.0 8.9862 x 10°)? 8.3364 x 107 
-20.0 2.5211 x 10° 1.2197 «x 104 
-22.5 5.7613 < 10% 8.8454 x 104 
-26.0 1.0917 x 10° 1.1228 x 107 
-32.0 0.11661 0.11664 
-40.0 0.31473 0.31475 
-60.0 0.48071 0.48071 
-80.0 0.49807 0.49807 


Table 5-6: Linear detection, a.c. erasure, High modulation level (ro = 10°, = 10°? ) 


Level of 


Probability of bit error 


erasure (dB) Best Linear Correlator 
0 < 10% < 10% 
-2.5 < 10% < 10°° 
-6.0 < 10% < 10% 
~12.0 5.6489 x 10°? 6.1655 x 10°? 
-20.0 9.9620 x 10° 1.0033 x 10° 
~22.5 1.7986 x 104 1.8063 x 10° 
-26.0 8.6687 x 10° 8.6867 x 10° 
-32.0 0.11707 0.11714 
-40.0 0.31709 0.31714 
-60.0 0.48102 0.48103 
-80.0 0.49810 0.49810 


Table 5-7: Linear detection, d.c. erasure, Low modulation level (r) = 10°, = 10% ) 


Level of 


Probability of bit error 


erasure (dB) Best Linear Correlator 
0 1.8388 x 10°)” 1.4856 x 10” 
-2.5 8.1583 x 107° 1.6005 x 10° 
-6.0 5.1764 x 107% 3.1346 X 10° 
-12.0 8.3227 x 10° 9.8230 x 10° 
-20.0 4.2084 x 10° 1.1268 x 10° 
-22.5 2.1392 x 10° 4.1303 x 10° 
-26.0 8.4699 x 10° 0.12181 
-32.0 0.24421 0.27981 
~40.0 0.39080 0.40801 
-60.0 0.48896 0.49074 
-80.0 0.49889 0.49907 


Table 5-8: Linear detection, d.c. erasure, High modulation level (ro = 10°, = 10°? ) 


Level of Probability of bit error 


erasure (dB) Optimal (exact) Best Linear Correlator 
-20.0 6.9444 x 10° 7.0279 x 107 7.0476 X 10° 
-22.5 1.4601 x 10% 1.4610 x 104 1.4635 x 104 
-26.0 7.8342 x 10° 7.8342 x 10° 7.8407 x 10° 
-32.0 0.11334 0.11337 0.11340 
-40.0 0.31440 0.31434 0.31436 
-60.0 0.48205 0.48071 0.48072 
-80.0 0.49828 0.49807 0.49807 


Table 5-9: Low modulation level ( r,) = 107, i= 104 ), @.c. erasure 


Level of Probability of bit error 
erasure (dB) Optimal (exact) Best Linear Correlator 
-12.0 7.8964 x 10°)? 8.9862 x 107)? 8.3364 x 107° 
-20.0 2.4194 x 10° 2.5211 x 10° 1.2197 x 10% 
-22.5 5.7445 x 10% 5.7613 X 10% 8.8454 x 10% 
-26.0 1.0191 x 107 1.0917 x 107 1.1228 x 10° 
-32.0 0.11659 0.11661 0.11664 
-40.0 0.31472 0.31473 0.31475 
-60.0 0.48071 0.48071 0.48071 
-80.0 0.49820 0.49807 0.49807 


Table 5-10: High modulation level ( i = 10°, r= 10°? ), &.c. erasure 


Level of 


Probability of bit error 


erasure (dB) Optimal (exact) Best Linear Correlator 
-20.0 8.2775 x 10° 9.9620 x 10°” 1.0033 x 10° 
-22.5 1.7785 x 107% 1.7986 X 10% 1.8063 x 10% 
--26.0 8.6484 x 10° 8.6687 x 10° 8.6867 10° 
-32.0 0.11689 0.11707 0.11714 
-40.0 0.31702 0.31709 0.31714 
-60.0 0.48100 0.48102 0.48103 
-80.0 0.49810 0.49810 0.49810 


Table 5-11: Low modulation level ( r, = 10°, i= 10% ), dc. erasure 


Level of Probability of bit error 
erasure (dB) Optimal (exact) Best Linear Correlator 
-12.0 3.8334 x 10° 8.3227 x 10° 9.8230x 10° 
-20.0 2.5685 x 10° 4.2084 x 10° 1.1268 x 10 
-22.5 1.47280 x 10° 2.1392 X 107 4.1303 X 107 
-26.0 5.2480 x 10°? 8.4699 X 10° 0.12181 
-32.0 0.21584 0.24421 0.27981 
-40.0 0.37726 0.39080 0.40801 
-60.0 0.48780 0.48896 0.49074 
-80.0 0.49878 0.49889 0.49907 


Table 5-12: High modulation level ( r= 107,r, = 10° ), d.c. erasure 


Chapter 6 


Conclusions and Discussion 


6.1. Summary and Conclusions 


In Chapter 2, we showed that the Lorentzian pulse model for the signal was a reasonable approximation. 
Then, in Chapter 3, we derived an expression for the average power spectrum of the noise from 
particulate recording media. We showed that the average power spectrum of the noise essentially 
exhibits a background noise term which is independent of the signal, and a modulation noise term which 
depends on the signal. Most importantly, we showed that even though the noise power depends on the 


writing frequency, the signal-to-noise ratio is independent of the signal frequency. 


In terms of modeling, the most significant contribution made in the thesis was the time-domain model 
for particulate media noise. This model allows us to obtain the two-dimensional] autocorrelation function 
of media noise from simple spectrum analyzer measurements. This method for obtaining the 
autocorrelation function of the media noise is better than the method of time averages suggested by 
Tang [16] in two ways. Firstly, time averages are much more complicated than spectrum analyzer 
measurements and are prone to timing errors. Secondly, using the autocorrelation functions R,, () and 
R,, (7 defined in Chapter 4, we can determine the two-dimensional noise autocorrelation function for 


any general signal written on the disc. 


From the results of the numerical evaluation of error probabilities for the detection schemes in Chapter 
5, we observed that bit error rates for d.c. erased signals are marginally higher than those for a.c. 
erased signals. This might lead one to suppose that d.c. erasure is a more effective way to ensure the 
security of information written on the disc. But it has been shown that d.c. fields induce significantly 
smaller erasure than a.c. fields of the same amplitude [10]. Hence, in most practical cases using a.c. 


fields for erasure would be a better choice than d.c. fields. 


In Chapter 5, we also saw that modulation noise does not play a significant role at high levels of 
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erasure. This means that for the specific problem of interest, namely, recovering information from 
erased discs, we could have used a simple band-limited white background noise model to represent the | 
noise. But this fact should not diminish the utility of the noise model that we developed in Chapter 4 


because in most other applications the signals are not erased. 


6.2. Some Comments About Simplifying Assumptions Made in This Thesis 


In Chapter 2, we approximated the flux waveform corresponding to an arctangent transition in 
magnetization by an arctangent of larger width. This approximation yields the Lorentzian signal pulse 
shape. We could perform a more detailed analysis to obtain an exact formulation for the pulse shape. 
But we see in Chapter 5 that in designing bit detection schemes the pulse shape was not as critical as 


total energy in the pulse. So as long as the Lorentzian model has the correct total energy we do not need 
a more sophisticated model. 


In Chapter 4, we made the assumption that media noise is Gaussian. This simplified all the ensuing 
analysis in Chapters 4 and 5. Even though this assumption has a strong basis, we could pursue all the 
analysis that we did in these chapters without making the Gaussian assumption because it is possible to 
obtain the N-th order statistics of the noise from simple particle interaction models [2]. We do not 
expect that the resulting error probability estimates will be significantly different. 


In designing the detection schemes discussed in Chapter 5, we made the implicit assumption that the 
sign of the pulse is known to the detector; otherwise, under hypothesis x, the expected value of the 
readback voltage vector r would be two valued. The justification for making this assumption is that 
positive and negative pulses occur alternately in a readback signal; and, hence, the choice of the sign of 
the pulse in the bit period of detection would be based on sign of the previous pulse. This procedure 
would result in propagation of errors when a sequence of bits is being detected. There are two ways of 
getting around this problem. One is to find a suitable encoding scheme for writing the bits on the disc 
which will limit the extent to which errors are allowed to propagate. The second is to redesign the 
detection scheme to accommodate the two-valued nature of E{r/H,}. 


In Chapter 5, we modeled the background noise by a bandlimited white noise term with a narrow 
bandwidth compared to the background noise. We could improve this mode] by considering the 
modulation noise to be the sum of a very narrow band noise term which represents noise due to surface 


asperities, and a narrow band noise term which represents the noise due to particle clustering. 
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6.3. Suggestions for Future Work 


We did not compare the performance of the conventional peak detection scheme with the schemes 
discussed in this report. Since the peak detection scheme is ad hoc, the only way to compare its 
performance with these schemes is by computer simulation or experimentation on rea] data. These 


simulations and experimentation can also be used to validate the results that we obtained by numerical 


evaluation. 


Another issue that we have not addressed in this thesis is the effect of RLL ( run length limited ) coding 


and error correcting coding on the bit error rates and sequence error rates. These are important when we 


consider high density recording. 


One simple way to estimate the sequence of bits that is stored on the disc is to put together the 
individual bit estimates that we obtain from bit detection schemes. Unfortunately this is not optimum 
when we have intersymbol interference. High density magnetic recording channels suffer from very high 


levels of intersymbol interference [12]. Hence, we need to consider other more sophisticated sequence 


estimation techniques. 


A recent paper by Duel-Hallen and Heegard [7] discusses the application of a new signal processing 
algorithm for sequence estimation in the presence of high intersymbol interference, called Delayed 
Decision Feedback Sequence Estimation ( DDFSE ), in digital magnetic recording channels. The noise 
mode] that is used in the analysis of this paper is the standard additive white Gaussian noise ( AWGN ) 
model. This analysis could be made much more accurate by using the nonstationary noise model that we 


proposed in this thesis, and it opens an area for prospective future work in this field. 
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